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<210> 1 
<211> 1475 
<212> DNA 

<213> Homo sapiens 



<400> 1 

caatatagta 

ttgatatgaa 

aggggggact 

tactgagggc 

tgactataaa 

gggcaaagtg 

actcgtggtg 

tagggtgaca 

cctgctgata 

gttgcttctc 

tattctgttt 

cagagcaaag 

agcagagagc 

agtaacagca 

gttgggagca 

gttgtcatat 

cagctcttct 

aaatagcccc 

actgtaatag 

gtttcctata 

gatggataaa 

acaatgggta 

aaaaaaaaaa 

aacattcttc 

gtgtcctggg 



caataactat 

gtatatggga 

tgagtatcct 

tgactgtata 

caaaatatat 

agttatacag 

gtttcctttt 

gtaggatcta 

caggaccaaa 

ttaactgtgg 

ctttggaact 

atgtacagag 

ttaaattaca 

gtaatagtta 

acagatgcta 

tactatatag 

tagtattctg 

tcaggaaaac 

acttatattt 

tttgtaatga 

agttcacctt 

aatagaattt 

aaaaaaccaa 

tgatgccggc 

ctgagtcccc 



ttgcatgaca 

ggatgtgcaa 

ttgttaccct 

gtcctatcct 

gtaataggtg 

ctccattctt 

gtctccttta 

cccgatagta 

gcatcctgtt 

cctcctacac 

ttgaatatac 

gaaacaatgc 

gctttaaaaa 

acggagccag 

aagagtagtt 

ctactgcatg 

tgtgacccca 

taagtttttc 

ctgaacattt 

acattcttct 

ttattgtata 

ttcagattat 

caaaaataaa 

atccctgctt 

ggggaagaat 



tttacatcgg 

aggtgatgtg 

caggagatcc 

cacggaactt 

gtggtaagta 

agaaaccttg 

ttacatggga 

gggtggcagt 

gttctcgagc 

tgtgttttgg 

aacactttac 

gtaactctga 

tttttatttt 

aatgcttgag 

gctgtagttc 

aagaagagtt 

tttgaccttt 

tctgctgttt 

tagtgcttgc 

tccggtacat 

aaattgactg 

taaaagctga 

cccaaacccc 

gcaggtgtga 

atgat 



atattatgag 

caaatactat 

tgaaaccagt 

tcattctaat 

ccgtggagaa 

gagtactttt 

ctctgacatg 

aggatctacc 

ctataaaaag 

atgattggtg 

tagggaatta 

tggaattgaa 

ttagagggaa 

tcatataatt 

ctctttgggt 

cttagtgagg 

taacaaatcc 

ttttgcttga 

caatatttgg 

tttttgttaa 

agattaattt 

aggatgacca 

tcaaacaatt 

agggggcagg 



tgatctagag 

gtcattttat 

cccccatgga 

gggggaagac 

gtaacaaatg 

cttagtttat 

tgcccatagc 

caaaaagcgt 

agctaatggt 

atgtcttgga 

gcaatggaag 

gtcatgaggc 

tttacttggg 

gcaaagcaga 

cgtaggagca 

cctgggtgaa 

ctaagtaaat 

gagagctata 

taatatttat 

attattgttt 

atacacattg 

cgtaagcaaa 

tcgaacacga 

aatcagcgag 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1475 



1 



<210> 2 

<211> 1793 

<212> DNA 

<213> Homo sapiens 



<220> 

<223> spinal muscular atrophy 5 (SMA5) 



<400> 2 

gcaaagccag ctgggctcct gagtccggtg 
gaggattgta aatgcaccaa tcagcatgct 
tattttgggc cagtggctgt cattacatat 
ctcaagctcc atgaggagat gtttcattgt 
acaatgcaac agagtgaaag agcaggttct 
agaaacttca catcaggtga tggataggag 
ttcaaacagg ctggtgaatt accagatctc 
agtgtgtctt ttgaatgcag agaacaaagt 
gctgaaggtg ctgggtgcca acctctggtg 
cctgtactcc tgggaggatg gtgattgctc 
tgacctttgg gaccaactcc acctacgcag 
tccgtgtgaa cagctactac tcttggtatc 
tgcagctggc cgcccagttt gagaattggt 
gcgtatggag tggaaacgct tgtagggttt 
tcaagtgcag taaccagttc aagttggaag 
tggacaacca ggctgggacc cagggccagc 
gtacctgatg cacgaacacc ccgcctacct 
ccaaagcctt ggacccctcc cagcctgtga 
acaagggggc tctgtatgtg gatgtgatcc 
actacgggca cctggagttg attcggctgc 
attggtgtaa gacatcacaa tcccattatt 
gggtttcacc agtctttccc agggaactcc 
aaccaagaag aggatgacat tagatccagg 
gcctgtgaag aaagatccct ggatcccagg 
gcaggccaga agacttccag ggaaactcat 
ctcacgcctg taataccagc actttgggag 
agtttgaaac tagcctggcc aacgtggcaa 
agccaggcat agtggtgcat gcctgtagtc 
attgcttgaa cctgggaggc agagtctgcg 
ctgggtgaca gagccagact ccgtctctac 



ggtacttgga gaacttacta cgtctagctg 60 
gtgtctagct caagattttc tccatcccct 120 
gagatgagtc tcttgaagac tacagatgaa 180 
cgagagcagt catgatggcc tgcactccac 240 
gcttctttgg tgtagtcctg aagcttccta 300 
caaccctgta aaaccagcct tagactattt 360 
cgtcaagtgc agtaaccagt tcaagttgga 42 0 
cgtggacaac caggctggga cccagggcca 4 80 
gccgtacctg atgcacgaac accccgccta 540 
acaccaaagc cttggacccc tcccagcctg 600 
cagacaaggg ggctctgtat gtggatgtga 660 
gcaactacgg gcacctggag ttgattcggc 720 
gtgagacatc acaatcccat tattcagagc 780 
caccagggct ggtgaattac cagatctccg 84 0 
tatgtctttt gaatgcagaa aacaaagtcg 900 
tgaaggtgct ggtgccaacc tctggtggcc 960 
gtactcgtgg gaggatggtg attgctcaca 1020 
cctttgggac caactccacc tacgcagcag 1080 
gtgtgaacag ctactactct tggtatcgca 1140 
aggccctgca gctggccgcc cagtttgtga 1200 
cagagcgcgt atggagtgga aacgcttgta 1260 
gatgaagtgt tccaacaaaa tgagcgagtg 13 2 0 
agatacaaca gaggagataa tctccaggat 13 80 
atgattatag gacaagttgt tcataatcca 1440 
ttcaagatga aaatggacca gccgcagtgg 1500 
gctgaggcgg gcggatcact tgaggtcaag 1560 
aactccatct ctattaaaga tacaaaaatt 1620 
ccagctactt gggatgctga ggcaggaaga 1680 
gtgaccgaga tcatgccact gcactccagc 1740 
taaaaaaaaa aaaaaaaaaa aaa 17 93 



<210> 3 
<211> 140 
<212> PRT 

<213> Homo sapiens 



<220> 

<22 3> spinal muscular atrophy 5 (SMA5) 



<400> 3 

Met Asp Arg Ser Asn Pro Val Lys 
1 5 

Arg Leu Val Asn Tyr Gin lie Ser 
20 

Leu Glu Val Cys Leu Leu Asn Ala 
35 40 

Ala Gly Thr Gin Gly Gin Leu Lys 
50 55 



Pro Ala Leu Asp Tyr Phe Ser Asn 
10 15 

Val Lys Cys Ser Asn Gin Phe Lys 
25 30 

Glu Asn Lys Val Val Asp Asn Gin 
45 

Val Leu Gly Ala Asn Leu Trp Trp 
60 



2 



Pro Tyr Leu Met His Glu His Pro 
65 70 

Gly Asp Cys Ser His Gin Ser Leu 
85 

Trp Asp Gin Leu His Leu Arg Ser 
100 

Cys Asp Pro Cys Glu Gin Leu Leu 
115 120 

Pro Gly Val Asp Ser Ala Ala Ala 
130 135 



Ala Tyr Leu Tyr Ser Trp Glu Asp 
75 80 

Gly Pro Leu Pro Ala Cys Asp Leu 
90 95 

Arg Gin Gly Gly Ser Val Cys Gly 
105 110 

Leu Leu Val Ser Gin Leu Arg Ala 
125 

Gly Arg Pro Val 
140 



<210> 4 

<211> 1372 

<212> DNA 

<213> Homo sapiens 

<220> 

<223> SMA3 

<400> 4 

ggcacgaggc tccgcccgcg 
cggtgggtac ttggagaact 
atgctgtgtc tagctcaaga 
catatgagaa ctcaagctcc 
tgcactccac acaatgcaac 
aagcttccta agaaacttca 
tagactattt ttcaaacagg 
tcaagttgga agtgtgtctt 
cccagggcca gctgaaggtg 
accccgccta cctgtactcg 
tcccagcctg tgacctttgt 
gtggatgtga tccgtgtgaa 
ttgattcagc tgcagctggc 
tattcagagc gcgtatggag 
ctccgatgaa gtgttccaac 
caggagatac aacagaggag 
caggatgatt ataggacaag 
tcattcaagg aggtgaaaat 
tggctcacgc ctgtaatacc 
aggagtttga aactagcctg 
attagccaag catagtggtg 
agaattgctt gaacctggga 
cagcctgggc aacacagtga 



gccgggatgc actaggcaaa 
tactacgtct agctggagga 
ttttctccat ccccttattt 
atgaggagat gtttcattgt 
agagtgaaag agcaggttct 
catcaggtga tggataggag 
ctggtgaatt accagatctc 
ttgaatgcag aaaacaaagt 
ctgggtgcca acctctggtg 
tgggaggatg gtgattgctc 
gaccaactcc acctacgcag 
cagctactac tcttggtatc 
cgcccagttt gagaattggt 
tggaaacgct tgtagggttt 
aaaatgagcg agtgaaccaa 
ataatctcca ggatgcctgt 
ttgttcataa tccagcaggc 
gatggatgac tcctccaaga 
agcactttgg gaggctgagg 
gccaacgtgg caaaactcca 
catgcctgta gtcccagcta 
ggcagagtct acagtgagcc 
gactccatct caaaaaaaaa 



gccagctggg ctcctgagtc 60 
ttgtaaatgc accaatcagc 12 0 
tgggccagtg gctgtcatta 180 
cgagagcagt catgatggcc 24 0 
gcttctttgg tgtagtcctg 300 
caaccctgta aaaccagcct 360 
cgtcaagtgc agtaaccagt 420 
cgtggacaac caggctggga 4 80 
gccgtacctg atgcacgaac 540 
acaccaaagc cttggacccc 600 
cagacaaggg ggctctgtat 660 
gcaactacgg gcacctggag 72 0 
gtaagacatc acaatcccat 780 
caccagtctt tcccagggaa 84 0 
gaagaggatg acattagatc 900 
gaagaaagat ccctggatcc 960 
cagaagactt ccagggaaac 1020 
tgaaaatgga ccagccgcag 1080 
caggcggatc acttgaggtc 114 0 
tctctattaa aaatacaaaa 1200 
cttgggatgc tgaggcagga 12 60 
gagatcatgc cactgcactc 1320 
aaaaaaaaaa aa 1372 



<210> 5 
<211> 140 
<212> PRT 

<213> Homo sapiens 

<220> 

<223> SMA3 
<400> 5 

Met Asp Arg Ser Asn Pro Val Lys Pro Ala Leu Asp Tyr Phe Ser Asn 
15 10 15 



3 



Arg Leu Val Asn Tyr Gin lie Ser Val Lys Cys Ser Asn Gin Phe Lys 
20 25 30 



Leu Glu Val Cys Leu Leu Asn Ala Glu Asn Lys Val Val Asp Asn Gin 
35 40 45 

Ala Gly Thr Gin Gly Gin Leu Lys Val Leu Gly Ala Asn Leu Trp Trp 
50 . 55 60 

Pro Tyr Leu Met His Glu His Pro Ala Tyr Leu Tyr Ser Trp Glu Asp 
65 70 75 80 

Gly Asp Cys Ser His Gin Ser Leu Gly Pro Leu Pro Ala Cys Asp Leu 
85 90 95 

Cys Asp Gin Leu His Leu Arg Ser Arg Gin Gly Gly Ser Val Cys Gly 
100 105 110 

Cys Asp Pro Cys Glu Gin Leu Leu Leu Leu Val Ser Gin Leu Arg Ala 
115 120 125 

Pro Gly Val Asp Ser Ala Ala Ala Gly Arg Pro Val 
130 135 140 



<210> 6 

<211> 873 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> modif ied_base 

<222> (1) . . (873) 

<223> n = g, a, c or t 



<400> 6 

acctgagatc 

aaatacaaaa 

ctgagacagg 

attgcattcc 

gcataatttg 

gatggtgtga 

caagtgtgaa 

ttctttgaaa 

acatggacat 

ttagcattgt 

tattagtatg 

cagatttttg 

acatgtatac 

gtgagtctag 

ataggttaat 



aggagttcga 
aattagctgg 
agaatcagtt 
aagcctgggc 
tggatgagga 
ttgcctggct 
agagagataa 
aagcaattgt 
cttagtggac 
gattaacacc 
ccatcaaatt 
aggatcttct 
aagggcactg 
cacagcattg 
tggtaaattt 



gatcagcctg 
acacgatggt 
gacctgggag 
aacaagagtg 
ttggatataa 
agaagaagaa 
atctgtgaag 
agcaaaaaaa 
tagaagttaa 
ttctaaaatt 
tagtagctgt 
ccctcttaaa 
tatacatcag 
cccaatagaa 
tctnatagnc 



accaataggg 
gggtgcctgt 
ttggtggttg 
aaactccatc 
ggtaaaggat 
ttcccggtca 
attataggga 
aagaaaattt 
gggcataaat 
gccagaactt 
ttcaggcttt 
agagtattca 
tctaaaaaat 
ataccaatgg 
acc 



tgaaaccccg 
ggtcccggct 
cagtgagctg 
gcaaaaaaaa 
gggacattct 
aaaagaaacc 
ctacaggaaa 
cttactgtca 
tctcccagtg 
aataaataat 
aatgtgtcaa 
gttaactgcc 
aaaaatatgt 
aggtcacaaa 



tctctactaa 
actcgggagg 
agatcacacc 
aaaagaaggg 
tggacttaca 
atcagctttc 
cttaatcttt 
tctaaaattg 
atttttaatt 
tgcttttcat 
gcctaaaatc 
gtagaaatac 
atacgttctg 
tgtggcccat 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

873 



<210> 7 
<211> 1505 
<212> DNA 

<213> Homo sapiens 
<400> 7 

gtaaaggaat gtctttttaa ttcagctttt cttttctcca tgctagtgtt atcaggtttt 60 
ggtatttatt tacttacagc atatgttatg aagctggttt gaaaattggt tttagatata 120 



4 



tctgcaagtt tactactttg actgtaaaaa 
ctcagaagaa gtttgcaggt tgcatatttg 
atgttccttg ggaattgaaa tggtcagtgg 
agcaccagtt cattcttttg caccttaggg 
caaccagttg cccacagact gttatttctt 
tatattctat ttttagtgta cagtgctttg 
tttgaaaaat gttataagac ttggacatta 
aagtaaaaca aataaattct atgcttcttc 
atataaatgg actatgtaaa gtgacatggt 
cctgttccaa caagataacc aacaggtata 
attggctgcc ttgaaatgct atatgtttta 
aggaaattac agttacatcc tggctaacat 
aaaattagcc gggcgtggtg gcgggcactt 
ggagaatggc gtgaacccag gaggcggagg 
tctggcctgg gagacagagc gagactccat 
gagagagaga cctggagtag agattctgtc 
gaaatggaat ctgttgtctc ttcgaaatat 
agtatgcctt cgtgtgggct actcgttgtg 
gggtcaagaa tatgcaactg gctgggcaca 
ggaggctgag gcaggcggat cacctgaggt 
gtgaaacccc gtctctactg aaaatacaaa 
aatcccagct actcgggagg ctgacgtgag 
gcagtgagcc gagattgcac cattgcactc 
ctcag 



aaaaaaatga aaaagtagtt gacatctgtc 180 
tgtgtaaata cacaggctaa aaggtaattt 240 
cccgttacag aaacttatca gtcatatatc 300 
accatctgtc ccctgaggtg acctgagaaa 3 60 
caagtgagcc aggatttgat ttcactgcct 420 
attttttgga aaaactaaat tttaaacata 480 
agtctgttga tagccaaagt cagtttacca 540 
attgtcaaag agcagtctgc catcatgtgg 600 
gcttactctc tacctaataa tagcctccct 660 
tttaatttac cagttaatat gttttggata 720 
tagtacatca tagctttagt tttcttcata 780 
ggtgaaactc catctctact aaaaatacaa 840 
gtagtcccag ctactcggga ggctgaggca 900 
ttgcagtgag ccgagatcgt gccactgtac 960 
ctcaaaaaaa aaaaaaaaaa aaaaaaaaga 102 0 
aaagaacttt ttctttcttg agaagcatct 1080 
gtactgctgt aacagtgaaa caaccctcag 1140 
gttttgaact tgggggaact gtctgtgttt 1200 
ttggctcacg cctgtaatcc cagcaatttg 1260 
cagggcttca agaccagact ggccaacatg 1320 
aattagctgg gcatggtggc aggtgcctgt 13 8 0 
agaatcgctt gaacccggga gttggaggtt 144 0 
cagcttgggc aacaagagtg aaactcttgt 1500 

1505 



<210> 8 

<211> 730 

<212> DNA 

<213> Homo sapiens 

.<400> 8 

gactaggctg ggcaacatag tgagacctca 
gaaaaaaacc taaaaacatg ccaagtgaca 
gtgatgccat gggtgggggt gggaaatgct 
acaccatgtg gctgggcacg gtggctcaca 
gcaggtggat cccttgagct taggaatttg 
catctcaaaa aaaaaaaaaa aaaaaggtgc 
cccattcttt tcctttttcc agaggtgatt 
agacactttt ctaggcatct atgcgcctct 
tgtgcttccc ttaaatggca tatgtatctt 
aataccttaa aagctcaacc tgggcttggt 
ttgggaggcc aaggtgggag gatcacttga 
aaacaaaagg attttgagat cagtgtgggc 
aaaaaaaaaa 



tctctaaaat taaaaaaata aaagccacca 60 
tcagtctttg atgaaaatgg cagcagaaga 12 0 
atttcagcag agagggagct gtcatggaag 180 
cctgtaatcc cagcactttg ggagatagag 240 
agactagcct gggcaataag agtgaaactc 300 
atgaaacata tgaagcaaaa agtgaaagtc 360 
tttgtggcca atctggtttc attccctccc 420 
attcacatat aaacaaaata ggagttttcc 480 
tcactctttt ttttcaccta gtggatcttt 540 
gcggtggctc atacgtgtaa tcccaggcct 600 
gctcaggagt tccagaccat tccaaagcaa 660 
aacttagcaa aacaccatct cttaaaaaaa 720 

730 



<210> 9 

<211> 1970 

<212> DNA 

<213> Homo sapiens 

<220> 

<223> DKFZp434A0131 

<400> 9 

ggtcgccctc cgtcgtggtc tggcgtgtat 
cgttggtgtc tggcggtttc cgagcgttgg 
gcggtttccg accgttggtg tctggcacgc 
ccgatgtacc agacaagaag acaagaaaat 
agaagaaagc agcaaaacga ccactgaaaa 



tccgagcctt ggtgtctggc ggtttccgag 60 
tgtctggcgg tttccgaccg ttggtgtctg 12 0 
gccaccctct cttgctttgg ttgcgccatg 180 
gatttgagga cagcttcaat cgcggtgtga 240 
caacgccggt ggcaaaatat ccaaagaaag 3 00 



5 



ggtcccaagc ggtacatcgt catagccgga 
tcaatgctgc gaaagctgcc aaaagtgaca 
tgaaggctct gaaagggctg tacggtaacc 
ctggccgctt caaggactgg atggtttcca 
tggaggccgt cagattactg atacttatcc 
tggactgtga gagcgtctac cccattgtgt 
tgggtgaatt tctgtactgg aaacttttct 
gaagagagca acgccagagc ccaggtgccc 
tctttgtgga gagcaagctc cacgaccacg 
gtgcagggac tcagctgaag gactgggagg 
agagcacgtg ccacatggag ccagggccag 
agagattgct tgcttcactt gtacaaggca 
gagttggaag gtggctaatc tttgattcta 
gggtgatgtg caggagagca cactgataga 
gcctcagcct cccgagcagc tgggactaca 
ttgtgttttt agtagagacg ggttttcacc 
ccttgtgatc cacctgcctc atcatcccaa 
gcccagcatg ttagacaatt tttaattcat 
aaaggaatat tctggtgggg acaaggttac 
ggtactggcc ttggcacagt gcctgggggg 
cagttactgt cagttcacag cgaaccttcc 
tgtaaccctc cctccctccc ttcttcctct 
cacacacaca cacacacaca cacacactcc 
taacgtctcg agttgccatt ggaagccccg 
ggcacttttg acttcctggg gtccttcttc 
gggcgtggtg gcacatgcct gtaatcccag 
sggsQtgagc caccatgccc agcctccgtt 
cacttttgac ttcctggggt ccttcttcag 



aacagtcaga gccaccagcc aatgatcttt 360 
tgcagcaccg agaagtccgc gtgaagtgcg 420 
gggacctgac cgcacgcctg gagctcttca 480 
tgatcatgga cagagagtac agtgtggcag 54 0 
ttaagaacat ggaaggggtg ctgatggacg 60 0 
aggcctctaa ttgaggcctg gcctctgctg 660 
accctgagtg cgagataaga acgatgggtg 720 
agaggacttt cttccagctt ctgctgtcct 780 
ctgcttactt agtagacaac ctgtgggact 84 0 
gtctgacaag cctgctgctg gagaaggacc 900 
ggaccttcca cctcctaggg tgaaaccagg 960 
ggaacggtgg catggggtgg gggaaacttg 1020 
tgtttttgat cctcctggca ctccagacct 1080 
aatccttgtg tccagtgccc agcaactcct 1140 
ggcgcccgcc accacgcctg gctaactttt 1200 
gtgttggcca ggatggtctt gatctcttga 1260 
agtgctggga ttacaggcgt gagccactgc 1320 
cctctctgtg ctgttgtttt ctcagctgtg 1380 
agagttgctg agagggtctc atgacatgaa 1440 
gcggggactc cgcacatgcc tgtgatgtca 1500 
ctccttttcc tgttgacttt cccacactcc 1560 
ctctctctct cactcacgca cacgcacaca 1620 
attcactgtc tccatgactc tggagtaaac 1680 
ttgtcctcat ttagactttc atgggttata 1740 
agttaaaaaa aaaaattaga aaattaggcc 1800 
caccttggcc tcccaaagtg ctgggattac 1860 
gtcctcattt agactttcat gggttatagg 1920 
ttaaaaaaaa aaaaaaaaaa 1970 



<210> 10 

<211> 1266 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> modif ied_base 
<222> (1) - . (1266) 
<223> n = g, a, c or t 



<400> 10 

agtggntccc 

cacctttcat 

ctgagcaagc 

ctttctataa 

aaagtactca 

gtggtctctc 

aggaattgtt 

attatgggaa 

atctagactg 

tgtgtttgta 

acaataacat 

tgtgaaatta 

gcgtcaattt 

cccgtaatta 

aacaattttt 

aaagaggaac 

ttttgtaacg 

agcgtagcaa 

attgggcata 

gttcattgca 



ccggnctgca 
tcaggcatga 
tcctgtttta 
cgtcctcctt 
gggagccctg 
ttttcactaa 
ccatggctgt 
ggatacatgc 
ctttccattt 
acttactatt 
ggagtggatt 
aatgggagaa 
atttgtgaat 
cttattgcag 
ttcctcatca 
atgctaaaga 
cttatacact 
ttccttgaaa 
tacccagtgg 
gcactattca 



ggaattcggc 
tgccaggtgc 
taaatggacc 
gtcctccttc 
cattgcttct 
atgtccaata 
catatatgtt 
tcctactagt 
agtctttatg 
ataaaatatg 
tacatgtatt 
gtatctgata 
attgaaagct 
tattgtgttc 
tgaagtaagg 
aaagaatact 
gctggtggga 
aacttaaaat 
aatgtaaatc 
caatagcaaa 



acgagatcat 
accaacttac 
agttcttgtt 
cacattctta 
tgaagccttc 
tgctgcacat 
aaaaatcatt 
aaatttagta 
catagctttc 
cgtctctatg 
ctctatattt 
cataacaggc 
ccaaaaaaga 
atacaaactg 
tatgctcact 
cccatataat 
atgtaaatta 
agatttaccg 
atcctgccat 
gacatggaat 



gatggctaat 

ttaatcctca 
gctgttgtac 
aagaaacttt 
tccagcttca 
aagtacccca 
aaaagttcat 
ggtagaaaaa 
gtgtctgcct 
ttcattgtca 
ggattaaagg 
aatacaaata 
aaaaaagttt 
ctcagtcatt 
gcaaaaaaaa 
ctctgtcttc 
gttcagccat 
ttcaacccag 
aaaaacacat 
caacctatat 



atttcctgag 
tagccaccac 
aagttatttt 
cccttccttt 
tcatctcaca 
aagttagcac 
tttttctctc 
aattatcact 
atttttacct 
acgattattt 
agatagagta 
ttatcacata 
ttttttaatt 
ttggagaaat 
tctagaaaat 
ataaataatc 
tgtgaaaagt 
caatcccatt 
gcacatgtat 
gcccatcaat 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 
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agtagactga ataaagaaaa tatggtacat attcaccaca gaatactaag cagccataaa 12 60 
aaaaaa ^^^^ 



<210> 11 
<211> 1524 
<212> DNA 

<213> Homo sapiens 
<220> 

<223> secreted phosphoprotein 1 (osteopontin, bone 

sialoprotein I, early T- lymphocyte activation 1) 
(SPPl) 



<400> 11 

gcagagcaca 

aacgccgacc 

ggcatcacct 

ctttacaaca 

cagaatctcc 

gatgatatgg 

aacgactctg 

tctgatgaat 

ttcactccag 

ggactgaggt 

gacgaggaca 

cccgttgccc 

gaaacgagtc 

tataagcgga 

ctttccaaag 

gttgtagacc 

ttagatagtg 

atttagtcaa 

ctcagtttat 

attagtttag 

ctatgttcat 

tcatgaatag 

ataacatttt 

tatctttttg 

aattgcttat 

gcctaaaaaa 



gcatcgtcgg 
aaggaaaact 
gtgccatacc 
aatacccaga 
tagccccaca 
atgatgaaga 
atgatgtaga 
ctgatgaact 
ttgtccccac 
caaaatctaa 
tcacctcaca 
aggacctgaa 
agctggatga 
aagccaatga 
tcagccgtga 
ccaaaagtaa 
catcttctga 
aagaaaaaat 
tggttgaatg 
tttgtggctt 
tctatagaag 
aaatttatgt 
atgtcactat 
tggtgtgaat 
ttgttttccc 
aaaaaaaaaa 



gaccagactc 
cactaccatg 
agttaaacag 
tgctgtggcc 
gacccttcca 
tgatgatgac 
tgacactgat 
ggtcactgat 
agtagacaca 
gaagtttcgc 
catggaaagc 
cgcgccttct 
ccagagtgct 
tgagagcaat 
attccacagc 
ggaagaagat 
ggtcaattaa 
gctttatagc 
tgtatctatt 
catggaaact 
aaatgcaaac 
agaagcaaac 
aatcttttgt 
aaatctttta 
acggttgtcc 
aaaa 



gtctcaggcc 
agaattgcag 
gctgattctg 
acatggctaa 
agtaagtcca 
catgtggaca 
gattctcacc 
tttcccacgg 
tatgatggcc 
agacctgaca 
gaggagttga 
gattgggaca 
gaaacccaca 
gagcattccg 
catgaatttc 
aaacacctga 
aaggagaaaa 
aaaatgaaag 
tgagtctgga 
ccctgtaaac 
tatcactgta 
aaaatacttt 
tttttaagtt 
tcttgaatgt 
agcaattaat 



agttgcagcc 
tgatttgctt 
gaagttctga 
accctgaccc 
acgaaagcca 
gccaggactc 
agtctgatga 
acctgccagc 
gaggtgatag 
tccagtaccc 
atggtgcata 
gccgtgggaa 
gccacaagca 
atgtgattga 
acagccatga 
aatttcgtat 
aatacaattt 
agaacatgaa 
aataactaat 
taaaagcttc 
ttttaatatt 
tacccactta 
agtgtatatt 
aataagaatt 
aaaacataac 



ttctcagcca 
ttgcctccta 
ggaaaagcag 
atctcagaag 
tgaccacatg 
cattgactcg 
gtctcaccat 
aaccgaagtt 
tgtggtttat 
tgatgctaca 
caaggccatc 
ggacagttat 
gtccagatta 
tagtcaggaa 
agatatgctg 
ttctcatgaa 
ctcactttgc 
atgcttcttt 
gtgtttgata 
agggttatgt 
tgttattctc 
aaaagagaat 
ttgttgtgat 
tggtggtgtc 
cttttttact 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1524 



<210> 12 
<211> 300 

<212> PRT 

<213> Homo sapiens 
<220> 

<223> secreted phosphoprotein 1 (osteopontin, bone 

sialoprotein I, early T- lymphocyte activation 1) 
(SPPl) 



<400> 12 

Met Arg lie Ala Val lie Cys Phe Cys Leu Leu Gly lie Thr Cys Ala 
15 10 15 

lie Pro Val Lys Gin Ala Asp Ser Gly Ser Ser Glu Glu Lys Gin Leu 
20 25 30 



7 



Tyr Asn Lys Tyr Pro Asp Ala Val Ala Thr Trp Leu Asn Pro Asp Pro 
35 40 45 



Ser Gin Lys Gin Asn Leu Leu Ala Pro Gin Thr Leu Pro Ser Lys Ser 
50 55 60 

Asn Glu Ser His Asp His Met Asp Asp Met Asp Asp Glu Asp Asp Asp 
65 70 75 80 

Asp His Val Asp Ser Gin Asp Ser lie Asp Ser Asn Asp Ser Asp Asp 
85 90 95 

Val Asp Asp Thr Asp Asp Ser His Gin Ser Asp Glu Ser His His Ser 
100 105 110 

Asp Glu Ser Asp Glu Leu Val Thr Asp Phe Pro Thr Asp Leu Pro Ala 
115 120 125 

Thr Glu Val Phe Thr Pro Val Val Pro Thr Val Asp Thr Tyr Asp Gly 
130 135 140 

Arg Gly Asp Ser Val Val Tyr Gly Leu Arg Ser Lys Ser Lys Lys Phe 
145 150 155 160 

Arg Arg Pro Asp lie Gin Tyr Pro Asp Ala Thr Asp Glu Asp lie Thr 
165 170 175 

Ser His Met Glu Ser Glu Glu Leu Asn Gly Ala Tyr Lys Ala lie Pro 
180 185 190 

Val Ala Gin Asp Leu Asn Ala Pro Ser Asp Trp Asp Ser Arg Gly Lys 
195 200 205 

Asp Ser Tyr Glu Thr Ser^ Gin Leu Asp Asp Gin Ser Ala Glu Thr His 
210 215 220 

Ser His Lys Gin Ser Arg Leu Tyr Lys Arg Lys Ala Asn Asp Glu Ser 
225 230 235 240 

Asn Glu His Ser Asp Val lie Asp Ser Gin Glu Leu Ser Lys Val Ser 
245 250 255 

Arg Glu Phe His Ser His Glu Phe His Ser His Glu Asp Met Leu Val 
260 265 270 

Val Asp Pro Lys Ser Lys Glu Glu Asp Lys His Leu Lys Phe Arg lie 
275 280 285 

Ser His Glu Leu Asp Ser Ala Ser Ser Glu Val Asn 
290 295 300 



<210> 13 

<211> 3205 

<212> DNA 

<213> Homo sapiens 

<220> 

<223> cadherin 3, type 1, P-cadherin (placental) {CDH3) 
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<400> 13 

aaaggggcaa gagctgagcg gaacaccggc 
ctctgcagcc atggggctcc ctcgtggacc 
ctggctgcag tgcgcggcct ccgagccgtg 
cttggaggcg ggaggcgcgg agcaggagcc 
ctgccctggg caagagccag ctctgtttag 
tggcgagaca gtccaggaaa gaaggtcact 
atccaaacgt atcttacgaa gacacaagag 
tgaaaatggc aagggtccct tcccccagag 
agacaccaag attttctaca gcatcacggg 
cttcgctgta gagaaggaga caggctggtt 
gattgccaag tatgagctct ttggccacgc 
ccccatgaac atctccatca tcgtgaccga 
ggacaccttc cgagggagtg tcttagaggg 
gacagccacg gatgaggatg atgccatcta 
ccatagccaa gaaccaaagg acccacacga 
caccatcagc gtcatctcca gtggcctgga 
catccaggcc acagacatgg atggggacgg 
gatccttgat gccaatgaca atgctcccat 
gcctgagaat gcagtgggcc atgaggtgca 
caactcacca gcgtggcgtg ccacctacct 
taccatcacc acccaccctg agagcaacca 
ttttgaggcc aaaaaccagc acaccctgta 
gctgaagctc ccaacctcca cagccaccat 
acctgtgttt gtcccaccct ccaaagtcgt 
gcctgtgtgt gtctacactg cagaagaccc 
catcctgaga gacccagcag ggtggctagc 
tgtgggcacc ctcgaccgtg aggatgagca 
ggtcttggcc atggacaatg gaagccctcc 
actgattgat gtcaatgacc atggcccagt 
ccaaagccct gtgcgccagg tgctgaacat 
ccctttccag gcccagctca cagatgactc 
ggaaggtgac acagtggtct tgtccctgaa 
gcacctttct ctgtctgacc atggcaacaa 
gtgcgactgc catggccatg tcgaaacctg 
ccctgtgctg ggggctgtcc tggctctgct 
gagaaagaag cggaagatca aggagcccct 
cgtcttctac tatggcgaag aggggggtgg 
gctccaccga ggtctggagg ccaggccgga 
catcatcccg acacccatgt accgtcctcg 
tataattgag aacctgaagg cggctaacac 
cttggtgttc gactatgagg gcagcggctc 
ctccgcctcc gaccaagacc aagattacga 
gaagctggca gacatgtacg gtggcgggga 
gggaccaaac gtcaggccac agagcatctc 
gacttcggag cttgtcagga agtggccgta 
acgttagagt ggttgcttcc ttagcctttc 
agcactgaaa acctctccac ctgggccagg 
tcttacctgc cgtaaaatgc tcaaccctgt 
tacagtggac tttctctctg gaatggaacc 
tttttttaat gctatcttca aaacgttaga 
gctgggccca ctggccgtcc tgcatttctg 
tggatctctg cgtttttata ctgagtgtgc 
gttgcgttgc tatagatgaa gggtgaggac 
taaagaaact tttcccagaa aaaaa 



ccgccgtcgc ggcagctgct tcacccctct 60 
tctcgcgtct ctcctccttc tccaggtttg 120 
ccgggcggtc ttcagggagg ctgaagtgac 180 
cggccaggcg ctggggaaag tattcatggg 24 0 
cactgataat gatgacttca ctgtgcggaa 3 00 
gaaggaaagg aatccattga agatcttccc 360 
agattgggtg gttgctccaa tatctgtccc 420 
actgaatcag ctcaagtcta ataaagatag 480 
gccgggggca gacagccccc ctgagggtgt 54 0 
gttgttgaat aagccactgg accgggagga 600 
tgtgtcagag aatggtgcct cagtggagga 66 0 
ccagaatgac cacaagccca agtttaccca 720 
agtcctacca ggtacttctg tgatgcaggt 780 
cacctacaat ggggtggttg cttactccat 84 0 
cctcatgttc accattcacc ggagcacagg 900 
ccgggaaaaa gtccctgagt acacactgac 960 
ctccaccacc acggcagtgg cagtagtgga 1020 
gtttgacccc cagaagtacg aggcccatgt 1080 
gaggctgacg gtcactgatc tggacgcccc 114 0 
tatcatgggc ggtgacgacg gggaccattt 1200 
gggcatcctg acaaccagga agggtttgga 1260 
cgttgaagtg accaacgagg ccccttttgt 1320 
agtggtccac gtggaggatg tgaatgaggc 13 80 
tgaggtccag gagggcatcc ccactgggga 144 0 
tgacaaggag aatcaaaaga tcagctaccg 1500 
catggaccca gacagtgggc aggtcacagc 1560 
gtttgtgagg aacaacatct atgaagtcat 162 0 
caccactggc acgggaaccc ttctgctaac 1680 
ccctgagccc cgtcagatca ccatctgcaa 1740 
cacggacaag gacctgtctc cccacacctc 1800 
agacatctac tggacggcag aggtcaacga 1860 
gaagttcctg aagcaggata catatgacgt 1920 
agagcagctg acggtgatca gggccactgt 1980 
ccctggaccc tggaagggag gtttcatcct 204 0 
gttcctcctg ctggtgctgc ttttgttggt 2100 
cctactccca gaagatgaca cccgtgacaa 2160 
cgaagaggac caggactatg acatcaccca 2220 
ggtggttctc cgcaatgacg tggcaccaac 2280 
gccagccaac ccagatgaaa tcggcaactt 2340 
agaccccaca gccccgccct acgacaccct 2400 
cgacgccgcg tccctgagct ccctcacctc 2460 
ttatctgaac gagtggggca gccgcttcaa 2520 
ggacgactag gcggcctgcc tgcagggctg 2 5 80 
caaggggtct cagttccccc ttcagctgag 2640 
gcaacttggc ggagacaggc tatgagtctg 2 7 00 
aggatggagg aatgtgggca gtttgacttc 2760 
gttgcctcag aggccaagtt tccagaagcc 2820 
gtcctgggcc tgggcctgct gtgactgacc 2880 
ttcttaggcc tcctggtgca acttaatttt 2940 
gaaagttctt caaaagtgca gcccagagct 3000 
gtttccagac cccaatgcct cccattcgga 3060 
ctaggttgcc ccttattttt tattttccct 3120 
aatcgtgtat atgtactaga acttttttat 3180 

3205 



<210> 14 
<211> 829 
<212> PRT 

<213> Homo sapiens 
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<220> 

<223> cadherin 3, type 1, P-cadherin (placental) (CDH3) 



<400> 14 
Met Gly Leu 
1 



Pro Arg Gly Pro Leu Ala Ser Leu Leu Leu Leu Gin Val 




Cys Trp Leu Gin Cys Ala Ala Ser Glu Pro Cys Arg Ala Val Phe Arg 
20 25 30 

Glu Ala Glu Val Thr Leu Glu Ala Gly Gly Ala Glu Gin Glu Pro Gly 
35 40 45 

Gin Ala Leu Gly Lys Val Phe Met Gly Cys Pro Gly Gin Glu Pro Ala 
50 55 60 

Leu Phe Ser Thr Asp Asn Asp Asp Phe Thr Val Arg Asn Gly Glu Thr 
65 70 75 80 

Val Gin Glu Arg Arg Ser Leu Lys Glu Arg Asn Pro Leu Lys lie Phe 
85 90 95 

Pro Ser Lys Arg He Leu Arg Arg His Lys Arg Asp Trp Val Val Ala 
100 105 110 

Pro He Ser Val Pro Glu Asn Gly Lys Gly Pro Phe Pro Gin Arg Leu 
lis 120 125 

Asn Gin Leu Lys Ser Asn Lys Asp Arg Asp Thr Lys He Phe Tyr Ser 
130 135 140 

He Thr Gly Pro Gly Ala Asp Ser Pro Pro Glu Gly Val Phe Ala Val 
145 150 155 160 

Glu Lys Glu Thr Gly Trp Leu Leu Leu Asn Lys Pro Leu Asp Arg Glu 
165 170 175 

Glu He Ala Lys Tyr Glu Leu Phe Gly His Ala Val Ser Glu Asn Gly 
180 185 190 

Ala Ser Val Glu Asp Pro Met Asn He Ser He He Val Thr Asp Gin 
195 200 205 

Asn Asp His Lys Pro Lys Phe Thr Gin Asp Thr Phe Arg Gly Ser Val 
210 215 220 

Leu Glu Gly Val Leu Pro Gly Thr Ser Val Met Gin Val Thr Ala Thr 
225 230 235 240 

Asp Glu Asp Asp Ala He Tyr Thr Tyr Asn Gly Val Val Ala Tyr Ser 
245 250 255 

He His Ser Gin Glu Pro Lys Asp Pro His Asp Leu Met Phe Thr He 
260 265 . 270 

His Arg Ser Thr Gly Thr He Ser Val He Ser Ser Gly Leu Asp Arg 
275 280 285 

Glu Lys Val Pro Glu Tyr Thr Leu Thr He Gin Ala Thr Asp Met Asp 
290 295 300 
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Gly Asp Gly Ser 
305 

Ala Asn Asp Asn 



Val Pro Glu Asn 
340 

Asp Leu Asp Ala 
355 

Met Gly Gly Asp 
370 

Ser Asn Gin Gly 
385 

Lys Asn Gin His 



Val Leu Lys Leu 
420 

Asp Val Asn Glu 
435 

Val Gin Glu Gly 
450 

Glu Asp Pro Asp 
465 

Asp Pro Ala Gly 



Ala Val Gly Thr 
500 

lie Tyr Glu Val 
515 

Thr Gly Thr Gly 
530 

Gly Pro Val Pro 
545 

Val Arg Gin Val 



Ser Pro Phe Gin 
580 

Ala Glu Val Asn 
595 

Phe Leu Lys Gin 
610 



Thr Thr Thr Ala 
310 

Ala Pro Met Phe 
325 

Ala Val Gly His 



Pro Asn Ser Pro 
360 

Asp Gly Asp His 
375 

lie Leu Thr Thr 
390 

Thr Leu Tyr Val 
405 

Pro Thr Ser Thr 



Ala Pro Val Phe 
440 

lie Pro Thr Gly 
455 

Lys Glu Asn Gin 
470 

Trp Leu Ala Met 
485 

Leu Asp Arg Glu 



Met Val Leu Ala 
520 

Thr Leu Leu Leu 
535 

Glu Pro Arg Gin 
550 

Leu Asn lie Thr 
565 

Ala Gin Leu Thr 



Glu Glu Gly Asp 
600 

Asp Thr Tyr Asp 
615 



Val Ala Val Val 
315 

Asp Pro Gin Lys 
330 

Glu Val Gin Arg 
345 

Ala Trp Arg Ala 



Phe Thr lie Thr 
380 

Arg Lys Gly Leu 
395 

Glu Val Thr Asn 
410 

Ala Thr He Val 
425 

Val Pro Pro Ser 



Glu Pro Val Cys 
460 

Lys He Ser Tyr 
475 

Asp Pro Asp Ser 
490 

Asp Glu Gin Phe 
505 

Met Asp Asn Gly 



Thr Leu He Asp 
540 

He Thr He Cys 
555 

Asp Lys Asp Leu 
570 

Asp Asp Ser Asp 
585 

Thr Val Val Leu 



Val His Leu Ser 
620 



Glu He Leu Asp 
320 

Tyr Glu Ala His 
335 

Leu Thr Val Thr 
350 

Thr Tyr Leu He 
365 

Thr His Pro Glu 



Asp Phe Glu Ala 
400 

Glu Ala Pro Phe 
415 

Val His Val Glu 
430 

Lys Val Val Glu 
445 

Val Tyr Thr Ala 



Arg He Leu Arg 
480 

Gly Gin Val Thr 
495 

Val Arg Asn Asn 
510 

Ser Pro Pro Thr 
525 

Val Asn Asp His 



Asn Gin Ser Pro 
560 

Ser Pro His Thr 
575 

He Tyr Trp Thr 
590 

Ser Leu Lys Lys 
605 

Leu Ser Asp His 
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Gly Asn Lys Glu Gin Leu Thr Val He Arg Ala Thr Val Cys Asp Cys 
625 630 635 640 



His Gly His Val Glu Thr Cys Pro Gly Pro Trp Lys Gly Gly Phe He 
645 650 655 

Leu Pro Val Leu Gly Ala Val Leu Ala Leu Leu Phe Leu Leu Leu Val 
660 665 670 

Leu Leu Leu Leu Val Arg Lys Lys Arg Lys lie Lys Glu Pro Leu Leu 
675 680 685 

Leu Pro Glu Asp Asp Thr Arg Asp Asn Val Phe Tyr Tyr Gly Glu Glu 
690 695 700 

Gly Gly Gly Glu Glu Asp Gin Asp Tyr Asp He Thr Gin Leu His Arg 
705 710 715 720 

Gly Leu Glu Ala Arg Pro Glu Val Val Leu Arg Asn Asp Val Ala Pro 
725 730 735 

Thr He He Pro Thr Pro Met Tyr Arg Pro Arg Pro Ala Asn Pro Asp 
740 745 750 

Glu He Gly Asn Phe He He Glu Asn Leu Lys Ala Ala Asn Thr Asp 
755 760 765 

Pro Thr Ala Pro Pro Tyr Asp Thr Leu Leu Val Phe Asp Tyr Glu Gly 
770 775 780 

Ser Gly Ser Asp Ala Ala Ser Leu Ser Ser Leu Thr Ser Ser Ala Ser 
785 790 795 800 

Asp Gin Asp Gin Asp Tyr Asp Tyr Leu Asn Glu Trp Gly Ser Arg Phe 
805 810 815 

Lys Lys Leu Ala Asp Met Tyr Gly Gly Gly Glu Asp Asp 
820 825 



<210> 15 
<211> 7080 
<212> DNA 

<213> Homo sapiens 
<220> 

<223> KIAA1199 



<400> 15 

gagctagcgc 

cggcgcgggg 

agctaccact 

acgtccgggg 

agagggagca 

tgctgaccat 

ctgggtgccc 

accatgtgca 

ccatccacat 

tgcgaacccg 

gccctttcca 

cggatcctta 



tcaagcagag 
agccagcggg 
ccgcttgccc 
ccgctgcgct 
cactgccagg 
cagctggctc 
tgaccagagc 
tatcggccag 
ctcagaggga 
gcacatcctg 
gggcaatttc 
ctatggtctg 



cccagcgcgg 
gctgagcgcg 
acgccccggg 
cctggcccgc 
atgggagctg 
actctgacct 
cctgagttgc 
ggcaagacac 
ggcaagctgg 
attgacaacg 
accatcattt 
aagtacattg 



tgctatcgga 
gccagggtct 
agctcgcggc 
gaggcgtgac 
ctgggaggca 
gcttccctgg 
aaccctggaa 
tgctgctcac 
tcattaaaga 
gaggagagct 
tgtatggaag 
gggttggtaa 



cagagcctgg 
gaacccagat 
gcctggcggt 
actgtctcgg 
ggacttcctc 
ggccacatcc 
ccctggccat 
ctcttctgcc 
ccacgacgag 
gcatgctggg 
ggctgatgaa 
aggaggcgct 



cgagcgcaag 
ttcccagact 
cagcgaccag 
ctacagaccc 
ttcaaggcca 
acagtggctg 
gaccaagacc 
acggtctatt 
ccgattgttt 
agtgccctct 
ggtattcagc 
cttgagttgc 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 
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atggacagaa 
cagaaggagg 
tcgaccccaa 
aagagagtga 
ttgcagtgaa 
aattgggaag 
tgaaaggaaa 
ctgctgctgc 
ctttgtccag 
tatctcagac 
tatgcaatcg 
aggttgtcta 
gccggagcta 
ccattgacac 
aacctggaga 
tccaggtgct 
tgtacctgca 
tgagccggaa 
acatctgcaa 
ttaaggcagc 
agtacccgat 
ccacatacat 
gctccaatgg 
tcacggaaga 
agtctggaac 
actcctaccc 
ggatggccaa 
gattttggtt 
gttattcaga 
gggctggcat 
ggccgttcct 
agccccggga 
cctggctgcg 
gcctgaccct 
agaacagctt 
tctggggccc 
caattagagg 
agtttgtggc 
ggcagagctg 
ccagagtgtt 
agacatctgt 
cgaagaatga 
gggccatttg 
tgcgaatgaa 
tcaccaggag 
ccatccactg 
agggcgactg 
cggatgttca 
tgcagatgga 
actcagggct 
gctccatgaa 
tcagtgactg 
cgatgcccaa 
agatggagag 
tggatgggaa 
accaagggcg 
ggcagctttt 
agggaagata 
ggggtctcaa 
tctgggtgac 
ctgtggtgaa 



aaagctctcc 
ctattttttt 
atcaggcaca 
acgtctggtc 
tgatgaaggt 
caaacacttc 
tccatcatct 
ccgggtattc 
tgagtgggtt 
taaaggtggg 
tcccattgat 
caaaaaaggc 
ccgtgtacgg 
caatgtgaac 
taccctggtc 
tccctgcaga 
catcggggag 
catcatagtg 
tttctttgac 
acacttggag 
tcacttccac 
cagggacctc 
cttgttgatc 
tgggccggag 
cctcctcccc 
ggggtacatc 
tcccaacaac 
tatttttcac 
gcacattcca 
gatcatagac 
ctcaatcatc 
gccggccatc 
cggcggggat 
ggccagtggt 
gtttgttggc 
tggcggcttg 
aattcagtta 
cctggagggc 
cccccataac 
cttcggagag 
gttccatgac 
caactggctg 
cagtgggtgc 
gatcatcaag 
cacccattac 
ggaccagacg 
gatccgagtg 
caatcgcctg 
caaagtggag 
gttgttcctg 
aggctgtgag 
cacagccaca 
gaagctcttt 
ttccaagcag 
gaagtacccc 
cgtggtgagc 
caactatgtg 
cgtctccaga 
gttgaaagag 
actggacact 
gaagaagaag 



tggacatttc 
gaaaggagct 
gtcatccatt 
cagtatttga 
tctcgaaatc 
ctgcaccttg 
tcagtggaag 
aaattgttcc 
caagacgtgg 
gagaaaattt 
atacaggcca 
caggattata 
ttcctctgtg 
agcaccattc 
attgccagta 
tcctgcgccc 
gagatagacg 
atgggggaga 
ttcgatacct 
ggcacggagc 
ctggccggtg 
tccatccatc 
aaggacgttg 
gaacgcaaca 
tcggaccgtg 
cccaagccca 
aacctcatca 
cacgtaccaa 
ctgggaaaat 
aacggagtca 
tctgccagat 
atcagacact 
gtgtggctgg 
ggaaccttcc 
gagagtggca 
gaccatagcg 
tatgatggcc 
cggcacacca 
aacgtgaccg 
cctgggccct 
gtcgacggct 
gtccggcacc 
tatgcacaga 
aatgacttcc 
cagcaatacc 
gcccccgccg 
gggctctgct 
ctgaagcaaa 
cagagctacc 
aagctgaaag 
aggataaaga 
gcttacccca 
ggttctcagc 
cacttcttcc 
agttcggagg 
cacacgagct 
gcgaccatcc 
ggcccatgga 
caaatggcat 
gaggatcaca 
ttgtgaggac 



tgaacaagac 
ggggccaccg 
ctgaccggtt 
acgcggtgcc 
tggatgacat 
gatttagaca 
accatattga 
agacagagca 
agtggacgga 
cagacctctg 
ctacaatgga 
ggtttgcttg 
ggaagcctgt 
tgaacttgga 
ctgattactc 
ccaaccaggt 
gcgtggacat 
tggaggacaa 
ttgggggcca 
tgaagcatat 
atgtagacga 
atacattctc 
tgggctataa 
cttttgacca 
acagcaagat 
ggcaagactg 
actgtgccgc 
cgggcccctc 
tctataacaa 
aaaccaccga 
acagccctca 
tcattgccta 
acagctgccg 
cgtatgacga 
acgtggggac 
gaaggaccct 
ccatcaacat 
gcgccctggc 
gcattgcctt 
ggttcaacca 
ccgtgtccga 
cagactgcat 
tgtacattca 
ccagccaccc 
aaccggttgt 
aactcgccat 
acccgcgagg 
cgtccaagac 
ctggcaggag 
ctcagaacga 
ttaaagctct 
agttcaccga 
tgaaaacaaa 
acctctggaa 
atggcatcca 
tcaggaactc 
ctgacaattc 
ccagagtgct 
tcgttggctt 
aagccaaaat 
agctgccgcc 



ccttcaccca 
tggagttatt 
tgacacctat 
cgatggcagg 
ggccaggaag 
cccttggagt 
atatcatgga 
tggcgaatat 
gtggttcgat 
gaaagctcac 
tggagttaac 
ctacgaccgg 
gaggcccaaa 
ggataatgta 
catgtaccag 
caaagtggca 
gcgggcggag 
atgctacccc 
catcaagttt 
gggacagcag 
aaggggaggt 
tcgctgcgtc 
ctctttgggc 
ctgtcttggc 
gtgcaagatg 
caatgctgtg 
tgcaggatct 
cgtgggaatg 
ccgagcacat 
ggcctctgcc 
ccaggacgcc 
caagaaccag 
gtttgctgac 
cggctccaag 
ggaaatgatg 
ccctataggc 
ccaaaactgc 
cttccgcctg 
tgaggacgtt 
gctggacatg 
gtaccctggc 
caatgttccc 
agcctacaag 
tctttacctg 
caccctgcag 
ctggctcatc 
caccacattc 
gggcgtcttc 
ccactactac 
gagagagaag 
gattccaaag 
gagggctgtc 
ggaccatttc 
cgacttcgct 
ggtggtggtg 
cattctgcaa 
catagtgctt 
ggaaaagctt 
caaaggcagc 
cttccaagtt 
cggtgccacc 



ggtggcatgg 
gttcatgtca 
agatccaaga 
atcctttctg 
gcgatgacca 
tttctaactg 
catcgaggct 
ttcaatgttt 
catgataaag 
ccaggaaaaa 
ctcagcaccg 
ggcagagcct 
ctcacagtca 
cagtcatgga 
gcagaagagt 
gggaaaccaa 
gttgggcttc 
tacagaaacc 
gctctgggat 
ctggtgggtc 
tatgacccac 
acagtccatg 
cactgcttct 
ctccttgtca 
atcacagagg 
tccaccttct 
gaggaaactg 
tactccccag 
tccaactacc 
aaggacaagc 
gacccgctga 
gaccacgggg 
aatggcattg 
caagagataa 
gacaatagga 
cagaattttc 
actttccgaa 
aataatgcct 
ccgattactt 
gatggggata 
tcctacctca 
gactggagag 
accagtaacc 
gagggggcgc 
aagggctaca 
aacttcaaca 
tccatcctct 
gtgaggacct 
tgggacgagg 
tttgctttct 
aacgcaggcg 
gtagacgtgc 
ttggaggtga 
tacattgaag 
attgacggga 
ggcataccat 
atggcatcaa 
ggggcagaca 
ttccggccca 
gtgcccatcc 
tcgtggtaga 



780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 

2760 

2820 

2880 

2940 

3000 

3060 

3120 

3180 

3240 

3300 

3360 

3420 

3480 

3540 

3600 

3660 

3720 

3780 

3840 

3900 

3960 

4020 

4080 

4140 

4200 

4260 

4320 

4380 
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ctatgacggt gactcttggc 
cagcagctgc ctgggaaggc 
accctggtgc tgccacctgc 
ggccaatgct ggaaacattc 
cctcttcagt gggggtttgg 
tccactttgg caggagccct 
gatccccatg gtcttcagca 
cttaaggaaa tctttactcc 
agaactggct atccttgggg 
agactttgag tggcaggttt 
gagattccag aaatctgctg 
ccactgatat ccatgatgct 
tgcctagctt gaggggtctg 
atgccaggtg gagaaatcac 
gctcaggaag gcttcttgct 
gaggcagcct ctggaatggc 
gtgactacgg ggtcgccctt 
gtcagagggg agcaatgggc 
gccctgcctc tgactccaag 
catttgctct tcatccaggg 
cgtactccct cggcctggga 
ttcattttaa cagatgggga 
gagggcctgg ggagccccac 
ccccagagtg cccaggcact 
ggaaaggaaa tgactagagt 
gcacacaaac ccgccctccc 
acctgtcagc ccagcctggg 
tcctgtctct gcagctctac 
tggtgagcca atttggctga 
tttcagctgc tgcttaatgc 
gctcctgtaa gagggagaac 
tccctgggtc ttgtgatgaa 
ccttcaaaga gggcctgcct 
gagtccattc cccaggtggg 
agctgctggg aggtgaccat 
agtcacagga aggacttctt 
cctcatgtcc ttcttgtcca 
acactgtgaa ccacttagga 
ggctcagttc atttaaaaaa 
acagtacagg atctgtacat 
taggcatttt cttggtagca 
ttctgtttgt aagacttaag 
cttgtctttt ttctgttgcc 
gtttgtatgt aaacatttct 
ttattatatg tgcacttcaa 



agcagaccag tgggggatgg 
cgtgtttcag ccctgatggg 
ccctactcaa gtgtctacct 
actttcctgc agcctcttgg 
ggaccatatc aggagacctg 
gacccagcta ggaggtagtc 
gacaagtgag ggtggtaaat 
tgtaagcaag agccaacctc 
aagaggcaag ccctgcctct 
ggacttggac tagatgactc 
catttcacat ggtacctgga 
gggtgcccca gcgcacacgg 
cagtccagta gggcaggcag 
agagaggtaa aatggaggcc 
tacaggaatg aaggctgggg 
tcagggattc agccctccct 
tgctcacgtc tctctggccc 
tttgctgctt atgagcacag 
agggtgaagt ccacagaagt 
aactgagcac agggggcctc 
tttcagagct ggaaatatag 
aagtgagccc ccaagatggg 
cctagccctt gctgccacac 
cctgaggtag cttctggaaa 
agaatgacag ctagcagatc 
cttggtgttg gcggtccctg 
tgcacagtag ctgcaactcc 
aggtgaggcc cagcagaggg 
tcttgggtgt ctgaacagct 
cctgctctct ccctggccca 
tctatctgtg gtttataatc 
ctacatttat cccctttcct 
ggctccctcc acccaactgc 
agccaactgt cagggaggtc 
agggc t c t gc 1 1 1 1 aaaga t 
ccagggagat tagtggtgat 
cggttttgtt gagttttcac 
tgtgatcact ttcaggtggc 
gatatctatt tgaaagttct 
aaaagtttct ttcctaaacc 
caaattttct tattgcttag 
tgagttaggt ctttaaggaa 
gaaatagctg gtcctttttc 
tgtaggcatc accatgaaca 
gaagtcactg tcagagaaat 



ctgggtcccc cagcccctgc 4440 
ccaagggaag gctatcagag 4500 
ggagcccctg gggcggtgct 4 560 
gtgcttctct cctatctgtg 4620 
ggttgtgctg acagcaaaga 4680 
tggagggctg gtcattcaca 4740 
gtaggagaaa gagccttggc 4 800 
acaggattag gagctggggt 4860 
ggccgtgtcc acctttcagg 4920 
tcaaaggccc ttttagttct 4980 
acccaacagt tcatggatat 5040 
gatggagagg tgagaactaa 5100 
tcaggtccat gtgcactgca 5160 
agtgccattt cagaggggag 522 0 
gcattttgct ggggggagat 52 80 
gccgctgcct gctgaagctg 5340 
actcatgatg gagaagtgtg 5400 
aggaattcag tccccaggca 5460 
gagctcctgc cttagggcct 5520 
caggagaccc tagatgtgct 5580 
aaaatatcta gcccaaagcc 564 0 
aaagaaccac acagctaagg 5700 
cacattgcct caacaaccgg 5760 
tggggacaag tcccctcgaa 582 0 
tcttccctcc tgctcccagc 5880 
tggccttcac tttgttcact 5940 
ccattggtgc tacctggctc 6000 
agtagggctc gccatgtttc 6060 
attgggtcca ccccagtccc 6120 
ccttatagag agcccaaaga 6180 
ttgcacgagg caccagagtc 6240 
gccccaacca caaactcttt 6300 
acccatgaga ctcggtccaa 6360 
tttcccacca aacatctttc 6420 
atggctgctt caaaggccag 64 80 
ggagaggaga gttaaaatga 654 0 
tcttctaatg caagggtctc 6600 
caggaatgtt gaatgtcttt 6660 
cagagttgta catatgtttc 6720 
attcaccaag agccaatatc 6780 
aaaattgtcc tccttgttat 6840 
agcaacgctc ctctgaaatg 6900 
gggagttaga tgtatagagt 6960 
aagatatatt ttctatttat 7020 
aaagaattgt cttaaatgtc 7080 



<210> 16 

<211> 1361 

<212> PRT 

<213> Homo sapiens 

<220> 

<223> KIAA1199 
<400> 16 

Met Gly Ala Ala Gly Arg Gin Asp Phe Leu Phe Lys Ala Met Leu Thr 
15 10 15 

lie Ser Trp Leu Thr Leu Thr Cys Phe Pro Gly Ala Thr Ser Thr Val 
20 25 30 
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Ala Ala Gly Cys 
35 

Gly His' Asp Gin 
50 

Leu Leu Thr Ser 
65 

Gly Lys Leu Val 



Arg His lie Leu 
100 

Leu Cys Pro Phe 
115 

Asp Glu Gly lie 
130 

Val Gly Lys Gly 
145 

Trp Thr Phe Leu 



Gly Tyr Phe Phe 
180 

Val lie Asp Pro 
195 

Thr Tyr Arg Ser 
210 

Ala Val Pro Asp 
225 

Ser Arg Asn Leu 



Ser Lys His Phe 
260 

Thr Val Lys Gly 
275 

His Gly His Arg 
290 

Thr Glu His Gly 
305 

Gin Asp Val Glu 



Thr Lys Gly Gly 
340 



Pro Asp Gin Ser 
40 

Asp His His Val 
55 

Ser Ala Thr Val 
70 

lie Lys Asp His 
85 

lie Asp Asn Gly 



Gin Gly Asn Phe 
120 

Gin Pro Asp Pro 
135 

Gly Ala Leu Glu 
150 

Asn Lys Thr Leu 
165 

Glu Arg Ser Trp 



Lys Ser Gly Thr 
200 

Lys Lys Glu Ser 
215 

Gly Arg lie Leu 
230 

Asp Asp Met Ala 
245 

Leu His Leu Gly 



Asn Pro Ser Ser 
280 

Gly Ser Ala Ala 
295 

Glu Tyr Phe Asn 
310 

Trp Thr Glu Trp 
325 

Glu Lys lie Ser 



Pro Glu Leu Gin 



His lie Gly Gin 
60 

Tyr Ser lie His 
75 

Asp Glu Pro lie 
90 

Gly Glu Leu His 
105 

Thr lie lie Leu 



Tyr Tyr Gly Leu 
140 

Leu His Gly Gin 
155 

His Pro Gly Gly 
170 

Gly His Arg Gly 
185 

Val He His Ser 



Glu Arg Leu Val 
220 

Ser Val Ala Val 
235 

Arg Lys Ala Met 
250 

Phe Arg His Pro 
265 

Ser Val Glu Asp 



Ala Arg Val Phe 
300 

Val Ser Leu Ser 
315 

Phe Asp His Asp 
330 

Asp Leu Trp Lys 
345 



Pro Trp Asn Pro 
45 

Gly Lys Thr Leu 



He Ser Glu Gly 
80 

Val Leu Arg Thr 

.95 

Ala Gly Ser Ala 
110 

Tyr Gly Arg Ala 
125 

Lys Tyr He Gly 



Lys Lys Leu Ser 
160 

Met Ala Glu Gly 
175 

Val He Val His 
190 

Asp Arg Phe Asp 
205 

Gin Tyr Leu Asn 



Asn Asp Glu Gly 
240 

Thr Lys Leu Gly 
255 

Trp Ser Phe Leu 

270 

His He Glu Tyr 
285 

Lys Leu Phe Gin 



Ser Glu Trp Val 
320 

Lys Val Ser Gin 
335 

Ala His Pro Gly 
350 
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Lys lie Cys Asn 
355 

Val Asn Leu Ser 
370 

Phe Ala Cys Tyr 
385 

Phe Leu Cys Gly 



Thr Asn Val Asn 
420 

Trp Lys Pro Gly 
435 

Tyr Gin Ala Glu 
450 

Asn Gin Val Lys 
465 

Glu lie Asp Gly 



Asn lie He Val 
500 

Asn His lie Cys 
515 

Lys Phe Ala Leu 
530 

Lys His Met Gly 
545 

Leu Ala Gly Asp 



He Arg Asp Leu 
580 

His Gly Ser Asn 
595 

Leu Gly His Cys 
610 

Phe Asp His Cys 
625 

Ser Asp Arg Asp 



Pro Gly Tyr He 
660 



Arg Pro He Asp 
360 

Thr Glu Val Val 
375 

Asp Arg Gly Arg 
390 

Lys Pro Val Arg 
405 

Ser Thr He Leu 



Asp Thr Leu Val 
440 

Glu Phe Gin Val 
455 

Val Ala Gly Lys 
470 

Val Asp Met Arg 
485 

Met Gly Glu Met 



Asn Phe Phe Asp 
520 

Gly Phe Lys Ala 
535 

Gin Gin Leu Val 
550 

Val Asp Glu Arg 
565 

Ser He His His 



Gly Leu Leu He 
600 

Phe Phe Thr Glu 
615 

Leu Gly Leu Leu 
630 

Ser Lys Met Cys 
645 

Pro Lys Pro Arg 



He Gin Ala Thr 



Tyr Lys Lys Gly 
380 

Ala Cys Arg Ser 
395 

Pro Lys Leu Thr 
410 

Asn Leu Glu Asp 
425 

He Ala Ser Thr 



Leu Pro Cys Arg 
460 

Pro Met Tyr Leu 
475 

Ala Glu Val Gly 
490 

Glu Asp Lys Cys 
505 

Phe Asp Thr Phe 



Ala His Leu Glu 
540 

Gly Gin Tyr Pro 
555 

Gly Gly Tyr Asp 
570 

Thr Phe Ser Arg 
585 

Lys Asp Val Val 



Asp Gly Pro Glu 
620 

Val Lys Ser Gly 
635 

Lys Met He Thr 
650 

Gin Asp Cys Asn 
665 



Thr Met Asp Gly 
365 

Gin Asp Tyr Arg 



Tyr Arg Val Arg 
400 

Val Thr He Asp 
415 

Asn Val Gin Ser 
430 

Asp Tyr Ser Met 
445 

Ser Cys Ala Pro 



His He Gly Glu 
480 

Leu Leu Ser Arg 
495 

Tyr Pro Tyr Arg 
510 

Gly Gly His He 
525 

Gly Thr Glu Leu 



He His Phe His 
560 

Pro Pro Thr Tyr 
575 

Cys Val Thr Val 
590 

Gly Tyr Asn Ser 
605 

Glu Arg Asn Thr 



Thr Leu Leu Pro 
640 

Glu Asp Ser Tyr 
655 

Ala Val Ser Thr 
670 
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Phe Trp Met Ala 
675 

Gly Ser Glu Glu 
690 

Gly Pro Ser Val 
705 

Leu Gly Lys Phe 



Met lie lie Asp 
740 

Lys Arg Pro Phe 
755 

Asp Ala Asp Pro 
770 

lie Ala Tyr Lys 
785 

Val Trp Leu Asp 



Leu Ala Ser Gly 
820 

lie Lys Asn Ser 
835 

Met Met Asp Asn 
850 

Arg Thr Leu Pro 
865 

Tyr Asp Gly Pro 



Ala Leu Glu Gly 
900 

Ala Trp Gin Ser 
915 

Asp Val Pro lie 
930 

Phe Asn Gin Leu 
945 

Val Asp Gly Ser 



Asp Asn Trp Leu 
980 



Asn Pro Asn Asn 
680 

Thr Gly Phe Trp 
695 

Gly Met Tyr Ser 
710 

Tyr Asn Asn Arg 
725 

Asn Gly Val Lys 



Leu Ser lie lie 
760 

Leu Lys Pro Arg 
775 

Asn Gin Asp His 
790 

Ser Cys Arg Phe 
805 

Gly Thr Phe Pro 



Leu Phe Val Gly 
840 

Arg lie Trp Gly 
855 

lie Gly Gin Asn 
870 

lie Asn lie Gin 
885 

Arg His Thr Ser 



Cys Pro His Asn 
920 

Thr Ser Arg Val 
935 

Asp Met Asp Gly 
950 

Val Ser Glu Tyr 
965 

Val Arg His Pro 



Asn Leu lie Asn 



Phe lie Phe His 
700 

Pro Gly Tyr Ser 
715 

Ala His Ser Asn 
730 

Thr Thr Glu Ala 
745 

Ser Ala Arg Tyr 



Glu Pro Ala lie 
780 

Gly Ala Trp Leu 
795 

Ala Asp Asn Gly 
810 

Tyr Asp Asp Gly 
825 

Glu Ser Gly Asn 



Pro Gly Gly Leu 
860 

Phe Pro lie Arg 
875 

Asn Cys Thr Phe 
890 

Ala Leu Ala Phe 
905 

Asn Val Thr Gly 



Phe Phe Gly Glu 
940 

Asp Lys Thr Ser 
955 

Pro Gly Ser Tyr 
970 

Asp Cys lie Asn 
985 



Cys Ala Ala Ala 
685 

His Val Pro Thr 



Glu His lie Pro 
720 

Tyr Arg Ala Gly 
735 

Ser Ala Lys Asp 

750 

Ser Pro His Gin 
765 

lie Arg His Phe 



Arg Gly Gly Asp 
800 

lie Gly Leu Thr 
815 

Ser Lys Gin Glu 
830 

Val Gly Thr Glu 
845 

Asp His Ser Gly 



Gly lie Gin Leu 
880 

Arg Lys Phe Val 
895 

Arg Leu Asn Asn 
910 

He Ala Phe Glu 
925 

Pro Gly Pro Trp 



Val Phe His Asp 
960 

Leu Thr Lys Asn 
975 

Val Pro Asp Trp 
990 
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Arg Gly Ala lie Cys Ser Gly Cys Tyr Ala Gin Met Tyr lie Gin Ala 
995 1000 1005 



Tyr Lys Thr Ser Asn Leu Arg Met Lys lie lie Lys Asn Asp Phe Pro 
1010 1015 1020 

Ser His Pro Leu Tyr Leu Glu Gly Ala Leu Thr Arg Ser Thr His Tyr 
1025 1030 1035 1040 

Gin Gin Tyr Gin Pro Val Val Thr Leu Gin Lys Gly Tyr Thr lie His 
1045 1050 1055 

Trp Asp Gin Thr Ala Pro Ala Glu Leu Ala lie Trp Leu lie Asn Phe 
1060 1065 1070 

Asn Lys Gly Asp Trp lie Arg Val Gly Leu Cys Tyr Pro Arg Gly Thr 
1075 1080 1085 

Thr Phe Ser lie Leu Ser Asp Val His Asn Arg Leu Leu Lys Gin Thr 
1090 1095 1100 

Ser Lys Thr Gly Val Phe Val Arg Thr Leu Gin Met Asp Lys Val Glu 
1105 1110 1115 1120 

Gin Ser Tyr Pro Gly Arg Ser His Tyr Tyr Trp Asp Glu Asp Ser Gly 
1125 1130 1135 

Leu Leu Phe Leu Lys Leu Lys Ala Gin Asn Glu Arg Glu Lys Phe Ala 
1140 1145 1150 

Phe Cys Ser Met Lys Gly Cys Glu Arg lie Lys lie Lys Ala Leu lie 
1155 1160 1165 

Pro Lys Asn Ala Gly Val Ser Asp Cys Thr Ala Thr Ala Tyr Pro Lys 
1170 1175 1180 

Phe Thr Glu Arg Ala Val Val Asp Val Pro Met Pro Lys Lys Leu Phe 
1185 1190 1195 1200 

Gly Ser Gin Leu Lys Thr Lys Asp His Phe Leu Glu Val Lys Met Glu 
1205 1210 1215 

Ser Ser Lys Gin His Phe Phe His Leu Trp Asn Asp Phe Ala Tyr lie 
1220 1225 1230 

Glu Val Asp Gly Lys Lys Tyr Pro Ser Ser Glu Asp Gly lie Gin Val 
1235 1240 1245 

Val Val lie Asp Gly Asn Gin Gly Arg Val Val Ser His Thr Ser Phe 
1250 1255 1260 

Arg Asn Ser lie Leu Gin Gly lie Pro Trp Gin Leu Phe Asn Tyr Val 
1265 1270 1275 1280 

Ala Thr lie Pro Asp Asn Ser lie Val Leu Met Ala Ser Lys Gly Arg 
1285 1290 1295 

Tyr Val Ser Arg Gly Pro Trp Thr Arg Val Leu Glu Lys Leu Gly Ala 
1300 1305 1310 
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Asp Arg Gly Leu Lys Leu Lys Glu Gin Met Ala Phe Val Gly Phe Lys 

1315 1320 1325 

Gly Ser Phe Arg Pro lie Trp Val Thr Leu Asp Thr Glu Asp His Lys 

1330 1335 1340 

Ala Lys lie Phe Gin Val Val Pro lie Pro Val Val Lys Lys Lys Lys 

1345 1350 1355 1360 

Leu 



<210> 17 
<211> 1619 
<212> DNA 

<213> Homo- sapiens 
<220> 

<223> type I intermediate filament cytoketatin (HAIKl) 



<400> 17 

agttctgcgg 

atgaactccg 

ggctggggcc 

gcccgcatct 

tctggaagaa 

gaccgcctgg 

gaaagccgca 

cagtatgagg 

gctcagatta 

tatgaaaatg 

accttagaca 

aaagagctca 

agtgacttca 

ctggaggata 

acttggtata 

cagagcagac 

ctgcaggcac 

cggtactcct 

acgcagctac 

aaaacccacc 

gggacacggg 

ataacccagg 

cacgcatgag 

tccttgcaca 

aaacttttat 

tattgcactt 

agccttcaga 



tgccagggag 
gacacagctt 
ggcccaggag 
ccctgtcctt 
gcagccccct 
cctcctacct 
tcctgaaatg 
aaaacatcac 
ttcttctcat 
aacactcctt 
acctgaccat 
ttctcatgaa 
atgtcaatgt 
tgagacaaga 
aagaacagtc 
aaggtgacat 
agtacagcac 
gcaagctcca 
gccacgaact 
tggagaagga 
aagaatcaaa 
agaccatcaa 
accaatgaaa 
gacaccagtg 
tttttttttt 
ttctaatcaa 
ttctcatcat 



tggagcagag 
cagccagacc 
cttccccagg 
caccacgcgg 
actaggcgga 
ggagaaggtt 
gcaccagcag 
acacctgcag 
tgacaatgcc 
taagaaagac 
tgtcacaaca 
ggagcaccat 
gaaggtggat 
atatgagctt 
tgcagccatg 
ccacgaactg 
gaaatctgct 
ggacatgcaa 
ggagcggcag 
aatcaccacg 
gtcgagcatg 
cggaagatta 
gtttccgcct 
agtgagttct 
ctgtaacagt 
agtgcgagtt 
tttgcatcta 



ctcagccccg 
ccctcggcct 
gctcccaccg 
agctgcccac 
aatgggaagg 
cgcgccctgg 
agagatcctg 
gagcagatag 
aggatggcag 
ttggaaattg 
gacctagaac 
gagcaggaaa 
acaggtccca 
ataataaaga 
tcccaggagg 
aagcgcacat 
ttggaaaaca 
gagatcatct 
aacaatgaat 
taccgacggc 
aaagtgtctg 
gttctttgtc 
gttgtaaagt 
aaaagatacc 
ctcaccagac 
tatgagggta 
ttttgtagcc 



tcccaaacac 
ccttccatgg 
tccatggcgg 
cccctggagg 
ccaccatgca 
aggaggccaa 
gcagtaagaa 
tggatggtaa 
tggatgactt 
aagtcgaggg 
aggaggtgga 
tggaggagca 
gggaagatct 
agaagcatcg 
cagccagtcc 
tccaggccct 
tgttatccga 
cccactatga 
accaagtgct 
tcctggaggg 
caactccaaa 
aagtgaatga 
ctattttccc 
cttggaatta 
ttctcataat 
aagctctact 
aataaaactc 



agatgggacc 
cgccggaggt 
tgcgggggga 
gtcttggggt 
gaatctcaac 
catgaagctg 
agattattcc 
gatgaccaat 
caacctcaag 
cctccgaagg 
aggaatgagg 
tcatgtgcca 
gattaaggtc 
agacttggac 
agccactgtg 
ggagattgac 
gacccagtct 
ggaggaactg 
gctgggcatc 
agagagtgaa 
gatcaaggcc 
aatccaaaag 
ccaaggaaag 
tcagactcag 
gctcttaata 
ttcctactgc 
cgcactagc 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

72 0 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1619 



<210> 18 
<211> 422 
<212> PRT 

<213> Homo sapiens 
<220> 

<223> type I intermediate filament cytokeratin (HAIKl) 
<400> 18 

Met Asn Ser Gly His Ser Phe Ser Gin Thr Pro Ser Ala Ser Phe His 
15 10 15 
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Gly Ala Gly Gly 
20 

Thr Val His Gly 
35 

Thr Arg Ser Cys 
50 

Ser Pro Leu Leu 
65 

Asp Arg Leu Ala 



Asn Met Lys Leu 
100 

Pro Gly Ser Lys 
115 

Leu Gin Glu Gin 
130 

Leu Leu lie Asp 
145 

Tyr Glu Asn Glu 



Gly Leu Arg Arg 
180 

Glu Gin Glu Val 
195 

His His Glu Gin 
210 

Val Asn Val Lys 
225 

Leu Glu Asp Met 



Arg Asp Leu Asp 
260 

Glu Ala Ala Ser 
275 

Glu Leu Lys Arg 
290 

Tyr Ser Thr Lys 
305 

Arg Tyr Ser Cys 



Gly Trp Gly Arg 



Gly Ala Gly Gly 
40 

Pro Pro Pro Gly 
55 

Gly Gly Asn Gly 
70 

Ser Tyr Leu Glu 
85 

Glu Ser Arg lie 



Lys Asp Tyr Ser 
120 

lie Val Asp Gly 
135 

Asn Ala Arg Met 
150 

His Ser Phe Lys 
165 

Thr Leu Asp Asn 



Glu Gly Met Arg 
200 

Glu Met Glu Glu 
215 

Val Asp Thr Gly 
230 

Arg Gin Glu Tyr 
245 

Thr Trp Tyr Lys 



Pro Ala Thr Val 
280 

Thr Phe Gin Ala 
295 

Ser Ala Leu Glu 
310 

Lys Leu Gin Asp 
325 



Pro Arg Ser Phe 
25 

Ala Arg lie Ser 



Gly Ser Trp Gly 
60 

Lys Ala Thr Met 
75 

Lys Val Arg Ala 
90 

Leu Lys Trp His 
105 

Gin Tyr Glu Glu 



Lys Met Thr Asn 
14 0 

Ala Val Asp Asp 
155 

Lys Asp Leu Glu 
170 

Leu Thr lie Val 
185 

Lys Glu Leu lie 



His His Val Pro 
220 

Pro Arg Glu Asp 
235 

Glu Leu lie lie 
250 

Glu Gin Ser Ala 
265 

Gin Ser Arg Gin 



Leu Glu lie Asp 
300 

Asn Met Leu Ser 
315 

Met Gin Glu lie 
330 



Pro Arg Ala Pro 
30 

Leu Ser Phe Thr 
45 

Ser Gly Arg Ser 



Gin Asn Leu Asn 
80 

Leu Glu Glu Ala 
95 

Gin Gin Arg Asp 
110 

Asn lie Thr His 
125 

Ala Gin He He 



Phe Asn Leu Lys 
160 

He Glu Val Glu 
175 

Thr Thr Asp Leu 
190 

Leu Met Lys Glu 
205 

Ser Asp Phe Asn 



Leu He Lys Val 
240 

Lys Lys Lys His 
255 

Ala Met Ser Gin 
270 

Gly Asp He His 
285 

Leu Gin Ala Gin 



Glu Thr Gin Ser 
320 

He Ser His Tyr 
335 
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Glu Glu Glu Leu Thr 
340 

Glu Tyr Gin Val Leu 
355 

Thr Thr Tyr Arg Arg 
370 

Glu Ser Lys Ser Ser 
385 

He Thr Gin Glu Thr 
405 

Glu lie Gin Lys His 
420 



Gin Leu Arg His Glu 
345 

Leu Gly lie Lys Thr 
360 

Leu Leu Glu Gly Glu 
375 

Met Lys Val Ser Ala 
390 

He Asn Gly Arg Leu 
410 

Ala 



Leu Glu Arg Gin Asn Asn 
350 

His Leu Glu Lys Glu lie 
365 

Ser Glu Gly Thr Arg Glu 
380 

Thr Pro Lys He Lys Ala 
395 400 

Val Leu Cys Gin Val Asn 
415 



<210> 19 

<211> 877 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> modif ied_base 

<222> (1) . . (877) 

<223> n = g, a, c or t 



<400> 19 

tttttttttt 

ttatccagca 

gactcaaagg 

nnnnacccat 

aaataaggta 

ataattatac 

tacaaggaca 

cattaattta 

ggcttgataa 

aaaggatgtt 

cataactatt 

tgcttgcaaa 

ttcagatgaa 

caggtttgga 

tggccttcaa 



ttaaaaaaaa 
tttgtattat 
gaacatataa 
cgtgcgatga 
tgggaaaaac 
atgtttggct 
gaaaacccac 
cttctcttag 
tttatttata 
acaatttact 
ttatagagat 
atagactttt 
ttagacagta 
ggcctcttct 
cccaagccag 



gaggcttggt 
gaaccagtga 
atgtttccta 
tcnnnnnnnn 
agatgttttc 
gtaataacta 
aacaacaaag 
acccgggaca 
tgttctagag 
tcaaaataat 
ggattatcat 
cctattggga 
catttttcag 
ttgtctttgc 
gagtttggct 



aagtttttga 
gtactgtaat 
tttttnnnnn 
nnnnnnnnnn 
attatcgcca 
tactaaagca 
atcgatcacg 
tgtgggacaa 
tctgaggatt 
acaatcatgg 
acatgggatt 
ggaacatctt 
gagaaccagc 
aaccataacc 
caaatga 



tacttagttg 
ttttctttcc 
nnnnnnnnnn 
nnnnnttggg 
cttaatcctt 
tgcttgtgaa 
aaagacaagg 
atacttttgt 
ttctttcagt 
tttaatttac 
ataaaaataa 
ttaacctaaa 
cttactggat 
ccttttcagc 



acttttagca 
ctttcagaaa 
nnnnnnnnnn 
atccagtttc 
acttccgatt 
agtagacttc 
catattcatt 
cctcatggat 
ggcagacaac 
agtgtaaatc 
cttacccata 
acggatttat 
gatcttttgt 
tgaagaccac 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

877 



<210> 20 
<211> 2124 
<212> DNA 

<213> Homo sapiens 

<220> 

<223> glycinamide ribonucleotide 
<400> 20 

gaattcgaac caggtggcca cccggtgtcg 
cagacagaac aatggcagcc cgagtactta 
tggcctggaa acttgcacag tctcatcatg 
caggcactgc ctgctctgaa aagatttcaa 
cccttgctca attctgcaaa gagaagaaaa 



synthetase (GARS) 



gtttcatttt cctttggaat ttctgcttta 60 
taattggcag tggaggaagg gaacatacgc 12 0 
tcaaacaagt gttggttgcc ccaggaaacg 180 
ataccgccat ctcaatcagt gaccacactg 24 0 
ttgaatttgt agttgttgga ccagaagcac 300 
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ctctggctgc tgggattgtt gggaacctga ggtctgcagg agtgcaatgc tttggcccaa 360 
cagcagaagc ggctcagtta gagtccagca aaaggtttgc caaagagttt atggacagac 42 0 
atggaatccc aaccgcacaa tggaaggctt tcaccaaacc tgaagaagcc tgcagcttca 4 80 
ttttgagtgc agacttccct gctttggttg tgaaggccag tggtcttgca gctggaaaag 540 
gggtgattgt tgcaaagagc aaagaagagg cctgcaaagc tgtacaagag atcatgcagg 600 
agaaagcctt tggggcagct ggagaaacaa ttgtcattga agaacttctt gacggagaag 660 
aggtgtcgtg tctgtgtttc actgatggca agactgtggc ccccatgccc ccagcacagg 720 
accataagcg attactggag ggagatggtg gccctaacac agggggaatg ggagcctatt 780 
gtccagcccc tcaggtttct aatgatctat tactaaaaat taaagatact gttcttcaga 84 0 
ggacagtgga tggcatgcag caagagggta ctccatatac aggtattctc tatgctggaa 900 
taatgctgac caagaatggc ccaaaagttc tagagtttaa ttgccgtttt ggtgatccag 960 
agtgccaagt aatcctccca cttcttaaaa gtgatcttta tgaagtgatt cagtccacct 1020 
tagatggact gctctgcaca tctctgcctg tttggctaga aaaccacacc gccctaactg 1080 
ttgtcatggc aagtaaaggt tatcctggag actacaccaa gggtgtagag ataacagggt 114 0 
ttcctgaggc tcaagctcta ggactggagg tgtcccatgc aggcactgcc ctcaaaaatg 12 0 0 
gcaaagtagt aactcatggg ggtagagttc ttgcagtcac agccatccgg gaaaatctca 1260 
tatcagccct tgaggaagcc aagaaaggac tagctgctat aaagtttgag ggagcaattt 132 0 
ataggaaaga catcggcttt cgtgccatag ctttcctcca gcagcccagg taaaactcta 1380 
agcaagttag ctgtagtgcc atttcagaaa ctggcctaaa tggctatgta gaacattcca 144 0 
ttaaccctat aagtcattca gtattctttt ctctctgtgg gagtgataca gtcttggttt 1500 
gtattttgtt tgaatcaaaa ctggttatag caatactcaa atggaaaaaa cttcatgata 1560 
gcgtaagttt ggaaagttta gcaaaatcac agtggtactg atttttattt gttttctatt 1620 
ttttttattt tatattttta atttttttaa cagggtcttc ctctctcgcc caagttctca 1680 
tgcctcagcc tcccaaatag ctgggactac aggcacaggc caccacacct ggctaatttt 1740 
tttgtatttt ttgtggagat ggggttcacc atgttgccaa ggccagtctg aaagcctggg 1800 
ctcaagtgat cctcctgctt tggcctccca aaatgctggg actataggca tgaggcgctg 1860 
cacttggcct gatactgatt tttattcctt gcgttatcac atagtgttgt atttgaaaca 1920 
tagttcatgg ttttatcaaa gaactgaaga tgagaatact ggtcatctaa ctttgtaatt 1980 
tgatttgatt atactgtaaa gtttgacagt cccattttaa cctgcgtttg tatctattac 2040 
taaaatgtat tttttgacct cttactgatt catggttggt atgtacaaac tgttgacttg 2100 
taaaatcaat aaagtcttag ttgg 2124 



<210> 21 

<211> 433 

<212> PRT 

<213> Homo sapiens 



<220> 

<223> glycinamide ribonucleotide synthetase (GARS) 
<400> 21 

Met Ala Ala Arg Val Leu lie lie Gly Ser Gly Gly Arg Glu His Thr 
15 10 15 

Leu Ala Trp Lys Leu Ala Gin Ser His His Val Lys Gin Val Leu Val 
20 25 30 

Ala Pro Gly Asn Ala Gly Thr Ala Cys Ser Glu Lys He Ser Asn Thr 
35 40 45 

Ala He Ser He Ser Asp His Thr Ala Leu Ala Gin Phe Cys Lys Glu 
50 55 60 

Lys Lys He Glu Phe Val Val Val Gly Pro Glu Ala Pro Leu Ala Ala 
65 70 75 80 

Gly He Val Gly Asn Leu Arg Ser Ala Gly Val Gin Cys Phe Gly Pro 
85 90 95 
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Thr Ala Glu Ala 
100 

Phe Met Asp Arg 
115 

Lys Pro Glu Glu 
130 

Leu Val Val Lys 
145 

Ala Lys Ser Lys 



Glu Lys Ala Phe 
180 

Leu Asp Gly Glu 
195 

Val Ala Pro Met 
210 

Asp Gly Gly Pro 
225 

Gin Val Ser Asn 



Arg Thr Val Asp 
260 

Leu Tyr Ala Gly 
275 

Phe Asn Cys Arg 
290 

Leu Lys Ser Asp 
305 

Leu Cys Thr Ser 



Val Val Met Ala 
340 

Glu He Thr Gly 
355 

His Ala Gly Thr 
370 

Arg Val Leu Ala 
385 

Glu Glu Ala Lys 



Ala Gin Leu Glu 



His Gly lie Pro 
120 

Ala Cys Ser Phe 
135 

Ala Ser Gly Leu 
150 

Glu Glu Ala Cys 
165 

Gly Ala Ala Gly 



Glu Val Ser Cys 
200 

Pro Pro Ala Gin 
215 

Asn Thr Gly Gly 
230 

Asp Leu Leu Leu 
245 

Gly Met Gin Gin 



He Met Leu Thr 
280 

Phe Gly Asp Pro 
295 

Leu Tyr Glu Val 
310 

Leu Pro Val Trp 
325 

Ser Lys Gly Tyr 



Phe Pro Glu Ala 
360 

Ala Leu Lys Asn 
375 

Val Thr Ala He 
390 

Lys Gly Leu Ala 
405 



Ser Ser Lys Arg 
105 

Thr Ala Gin Trp 



He Leu Ser Ala 
140 

Ala Ala Gly Lys 
155 

Lys Ala Val Gin 
170 

Glu Thr He Val 
185 

Leu Cys Phe Thr 



Asp His Lys Arg 
220 

Met Gly Ala Tyr 
235 

Lys He Lys Asp 
250 

Glu Gly Thr Pro 
265 

Lys Asn Gly Pro 



Glu Cys Gin Val 
300 

He Gin Ser Thr 
315 

Leu Glu Asn His 
330 

Pro Gly Asp Tyr 
345 

Gin Ala Leu Gly 



Gly Lys Val Val 
380 

Arg Glu Asn Leu 
395 

Ala He Lys Phe 
410 



Phe Ala Lys Glu 
110 

Lys Ala Phe Thr 
125 

Asp Phe Pro Ala 



Gly Val He Val 
160 

Glu He Met Gin 
175 

He Glu Glu Leu 
190 

Asp Gly Lys Thr 
205 

Leu Leu Glu Gly 



Cys Pro Ala Pro 
240 

Thr Val Leu Gin 
255 

Tyr Thr Gly He 
270 

Lys Val Leu Glu 
285 

He Leu Pro Leu 



Leu Asp Gly Leu 
320 

Thr Ala Leu Thr 
335 

Thr Lys Gly Val 
350 

Leu Glu Val Ser 
365 

Thr His Gly Gly 



He Ser Ala Leu 
400 

Glu Gly Ala He 
415 
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Tyr Arg Lys Asp lie Gly Phe Arg Ala lie Ala Phe Leu Gin Gin Pro 
420 425 430 



Arg 



<210> 22 

<211> 2424 

<212> DNA 

<213> Homo sapiens 



<400> 22 

atgccccctt 

tctctccctc 

agcaaaatga 

agcgtcggga 

ctggacatca 

ctggaattcc 

atggttttca 

gggttgcctg 

aagtcccagg 

tttgctgtgg 

agagggcagc 

accctcagca 

ccctgtgagc 

agaggatcgc 

agagtgttcc 

tcgcagccct 

ctctgcccgc 

agggtcgacc 

cgggccaaag 

cgagtgggtg 

caggatgtgc 

ctgacgggca 

ggccaggacc 

gttgcgggcc 

gaggccgtgc 

tcggatcctc 

cagcggccag 

tcagtagggc 

tttgaggtga 

actgccttcg 

gccccctacc 

gtgatgaccg 

ggcgggagag 

tctgtcttgg 

ccccgggatt 

ctcattgagt 

tgcatgaatg 

ggctgggagg 

cagggatgga 

cgtacccctc 

aatgtctgtg 



tcctgttgct 
tccaggaagt 
tgtggtgctc 
aagggagctt 
gccccgagag 
ccttggattc 
aaggagggcg 
gaggcagaaa 
gggatgtggc 
gggtcaggtt 
acgtgctgtt 
gctcggccat 
acaggacgct 
ggcggaccct 
taacccaccc 
gccagaatgg 
tggcctttgg 
tcctcttcct 
tcttcgtgaa 
tggccacata 
ctgacctggt 
gtgccttgcg 
ggccacgtag 
cagcgcgtca 
gggcagagct 
aggatctgtt 
ggtgccggac 
ccgagaattt 
accctgacgt 
ggctggacac 
taggtggggt 
tccagagggg 
gcgcagagga 
tcgtgggcgt 
ccctgatcca 
ggctgtgtgg 
agggcagctg 
gcccccactg 
ttcttgagac 
ccagcaacta 
ccccaggtcc 



ggaggccgtc 
ccatgtaagc 
ggctgcagtg 
tgaaaggtcc 
ggtcagagtg 
attttcaacc 
cacggagacg 
tgcttctgtg 
actgccatcc 
tcccaggtgg 
ggctgagcag 
ctgctccagc 
ggagatggtc 
tgcggtgctg 
tgccacctgc 
aggcacatgt 
aggggaggct 
gctggacagc 
gcggtttgtg 
cagcagggag 
ctggagcctc 
gcaggcggca 
agtggtggtt 
cgcaagggcg 
ggaggagatc 
caaccaaatc 
acaagccctg 
tgctcagatg 
gacacaggtc 
caaacccacc 
gggctcagcc 
tgcccggcct 
tgcagccgtt 
ggggcctgtc 
cgtggcagct 
agaagccaag 
cgtcctgcag 
cgagaaccgt 
gcccctgagg 
cagagaaggc 
ttag 



tgtgttttcc 
aaagaaacca 
gacatcatgt 
aagcactttg 
ggagcattcc 
caacaggaag 
gaacttgctc 
ccccagatcc 
aagcagctga 
gaggagctgc 
gtggaggatg 
gccacgccag 
cgggagttcg 
gctgcacact 
tacaggacca 
gttccagaag 
aactgtgccc 
tctgcgggca 
cgggccgtgc 
ctgctggtgg 
gatggcattc 
gagcgtggct 
ttgctcactg 
cgagagctgc 
acaggcagcc 
cctgagctgc 
gacctcgtct 
cagagctttg 
ggcctggtgg 
cgggctgcga 
ggcaccgccc 
ggtgtcccca 
cctgcccaga 
ctaagtgagg 
tacgccgacc 
cagccagtca 
aatgggagct 
gagtggagct 
cacatggctc 
ctgggcactg 



tgttttccag 
tcgggaagat 
ttctgttaga 
ccatcacagt 
agttcagttc 
tgaaggcaag 
tgaaatacct 
tcatcatcgt 
aggaaagggg 
atgcactggc 
ccaccaacgg 
actgcagggt 
ctggcaatgc 
gtcccttcta 
cctgcccagg 
gactggacgg 
tgaagctgag 
ccactctgga 
tgagcgagga 
cggtgcctgt 
ccttccgtgg 
tcgggagcgc 
agtcacactc 
tcctgctggg 
caaagcatgt 
aggggaagct 
tcatgttgga 
tgagaagctg 
tgtatggcag 
tgctgcgggc 
tgctgcacat 
aagctgtggt 
agctgaggaa 
gtctgcggag 
tgcggtacca 
acctctgcaa 
accgctgcaa 
cttgctctgt 
ccgtgcagga 
aaatggtgcc 



agtgccccca 
ttcagctgcc 
tgggtctaac 
ctgtgacggt 
cactcctcat 
aatcaagagg 
tctgcacaga 
cactgatggg 
tgtcactgtg 
cagcgagcct 
cctcttcagc 
cgaggctcac 
cccatgctgg 
cagctggaag 
cccctgtgac 
ctaccagtgc 
cctggaatgc 
cggcttcctg 
ctctcgggcc 
gggggagtac 
tggccccacc 
caccaggaca 
cgaggatgag 
tgtaggcagt 
gatggtctac 
gtgcagccgg 
cacctctgcc 
tgccctccag 
ccaggtgcag 
cattagccag 
ctatgacaaa 
ggtgctcaca 
caatggcatc 
gcttgcaggt 
ccaggacgtg 
acccagcccg 
gtgtcgggat 
atgtgtgagc 
gggcagcagc 
taccttctgg 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2424 



<210> 23 

<211> 807 

<212> PRT 

<213> Homo sapiens 
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<400> 23 
Met Pro Pro Phe 
1 

Arg Val Pro Pro 
20 

Thr lie Gly Lys 
35 

Ala Val Asp lie 
50 

Gly Ser Phe Glu 
65 

Leu Asp lie Ser 



Ser Thr Pro His 
100 

Glu Val Lys Ala 
115 

Glu Thr Glu Leu 
130 

Gly Arg Asn Ala 
145 

Lys Ser Gin Gly 



Gly Val Thr Val 
180 

Leu His Ala Leu 
195 

Glu Gin Val Glu 

210 

Ser Ala lie Cys 
225 

Pro Cys Glu His 



Ala Pro Cys Trp 
260 

His Cys Pro Phe 
275 

Thr Cys Tyr Arg 
290 

Gin Asn Gly Gly 
305 



Leu Leu Leu Glu 
5 

Ser Leu Pro Leu 



lie Ser Ala Ala 
40 

Met Phe Leu Leu 
55 

Arg Ser Lys His 
70 

Pro Glu Arg Val 
85 

Leu Glu Phe Pro 



Arg lie Lys Arg 
120 

Ala Leu Lys Tyr 
135 

Ser Val Pro Gin 
150 

Asp Val Ala Leu 
165 

Phe Ala Val Gly 



Ala Ser Glu Pro 
200 

Asp Ala Thr Asn 
215 

Ser Ser Ala Thr 
230 

Arg Thr Leu Glu 
245 

Arg Gly Ser Arg 



Tyr Ser Trp Lys 
280 

Thr Thr Cys Pro 
295 

Thr Cys Val Pro 
310 



Ala Val Cys Val 
10 

Gin Glu Val His 
25 

Ser Lys Met Met 



Asp Gly Ser Asn 
60 

Phe Ala lie Thr 
75 

Arg Val Gly Ala 
90 

Leu Asp Ser Phe 
105 

Met Val Phe Lys 



Leu Leu His Arg 
140 

lie Leu lie lie 
155 

Pro Ser Lys Gin 
170 

Val Arg Phe Pro 
185 

Arg Gly Gin His 



Gly Leu Phe Ser 
220 

Pro Asp Cys Arg 
235 

Met Val Arg Glu 
250 

Arg Thr Leu Ala 
265 

Arg Val Phe Leu 



Gly Pro Cys Asp 
300 

Glu Gly Leu Asp 
315 



Phe Leu Phe Ser 
15 

Val Ser Lys Glu 
30 

Trp Cys Ser Ala 
45 

Ser Val Gly Lys 



Val Cys Asp Gly 
80 

Phe Gin Phe Ser 
95 

Ser Thr Gin Gin 
110 

Gly Gly Arg Thr 
125 

Gly Leu Pro Gly 



Val Thr Asp Gly 
160 

Leu Lys Glu Arg 
175 

Arg Trp Glu Glu 
190 

Val Leu Leu Ala 
205 

Thr Leu Ser Ser 



Val Glu Ala His 
240 

Phe Ala Gly Asn 
255 

Val Leu Ala Ala 
270 

Thr His Pro Ala 
285 

Ser Gin Pro Cys 



Gly Tyr Gin Cys 
320 
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Leu Cys Pro Leu 



Ser Leu Glu Cys 
340 

Gly Thr Thr Leu 
355 

Phe Val Arg Ala 
370 

Ala Thr Tyr Ser 
385 

Gin Asp Val Pro 



Gly Gly Pro Thr 
420 

Gly Phe Gly Ser 
435 

Val Val Leu Leu 
450 

Ala Arg His Ala 
465 

Glu Ala Val Arg 



Val Met Val Tyr 
500 

Leu Gin Gly Lys 
515 

Ala Leu Asp Leu 
530 

Glu Asn Phe Ala 
545 

Phe Glu Val Asn 



Ser Gin Val Gin 
580 

Ala Met Leu Arg 
595 

Ser Ala Gly Thr 
610 

Gin Arg Gly Ala 
625 



Ala Phe Gly Gly 
325 

Arg Val Asp Leu 



Asp Gly Phe Leu 
360 

Val Leu Ser Glu 
375 

Arg Glu Leu Leu 
390 

Asp Leu Val Trp 
405 

Leu Thr Gly Ser 



Ala Thr Arg Thr 
440 

Thr Glu Ser His 
455 

Arg Ala Arg Glu 
470 

Ala Glu Leu Glu 
485 

Ser Asp- Pro Gin 



Leu Cys Ser Arg 
520 

Val Phe Met Leu 
535 

Gin Met Gin Ser 
550 

Pro Asp Val Thr 
565 

Thr Ala Phe Gly 



Ala lie Ser Gin 
600 

Ala Leu Leu His 
615 

Arg Pro Gly Val 
630 



Glu Ala Asn Cys 
330 

Leu Phe Leu Leu 
345 

Arg Ala Lys Val 



Asp Ser Arg Ala 
380 

Val Ala Val Pro 
395 

Ser Leu Asp Gly 
410 

Ala Leu Arg Gin 
425 

Gly Gin Asp Arg 



Ser Glu Asp Glu 
460 

Leu Leu Leu Leu 
475 

Glu lie Thr Gly 
490 

Asp Leu Phe Asn 
505 

Gin Arg Pro Gly 



Asp Thr Ser Ala 
540 

Phe Val Arg Ser 
555 

Gin Val Gly Leu 
570 

Leu Asp Thr Lys 
585 

Ala Pro Tyr Leu 



lie Tyr Asp Lys 
620 

Pro Lys Ala Val 
635 



Ala Leu Lys Leu 
335 

Asp Ser Ser Ala 
350 

Phe Val Lys Arg 
365 

Arg Val Gly Val 



Val Gly Glu Tyr 
400 

lie Pro Phe Arg 
415 

Ala Ala Glu Arg 
430 

Pro Arg Arg Val 
445 

Val Ala Gly Pro 



Gly Val Gly Ser 
480 

Ser Pro Lys His 
495 

Gin lie Pro Glu 
510 

Cys Arg Thr Gin 
525 

Ser Val Gly Pro 



Cys Ala Leu Gin 
560 

Val Val Tyr Gly 
575 

Pro Thr Arg Ala 
590 

Gly Gly Val Gly 
605 

Val Met Thr Val 



Val Val Leu Thr 
640 
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Gly Gly Arg Gly 



Asn Asn Gly lie 
660 

Glu Gly Leu Arg 
675 

Ala Ala Tyr Ala 
690 

Leu Cys Gly Glu 
705 

Cys Met Asn Glu 



Lys Cys Arg Asp 
740 

Ser Ser Cys Ser 
755 

Leu Arg His Met 
770 

Ser Asn Tyr Arg 
785 



Ala Glu Asp Ala 
645 

Ser Val Leu Val 



Arg Leu Ala Gly 
680 

Asp Leu Arg Tyr 
695 

Ala Lys Gin Pro 
710 

Gly Ser Cys Val 
725 

Gly Trp Glu Gly 



Val Cys Val Ser 
760 

Ala Pro Val Gin 
775 

Glu Gly Leu Gly 
790 



Ala Val Pro Ala 
650 

Val Gly Val Gly 
665 

Pro Arg Asp Ser 



His Gin Asp Val 
700 

Val Asn Leu Cys 

715 

Leu Gin Asn Gly 
730 

Pro His Cys Glu 
745 

Gin Gly Trp lie 



Glu Gly Ser Ser 
780 

Thr Glu Met Val 
795 



Gin Lys Leu Arg 
655 

Pro Val Leu Ser 
670 

Leu lie His Val 
685 

Leu lie Glu Trp 



Lys Pro Ser Pro 
720 

Ser Tyr Arg Cys 
735 

Asn Arg Glu Trp 
750 

Leu Glu Thr Pro 
765 

Arg Thr Pro Pro 



Pro Thr Phe Trp 
800 



Asn Val Cys Ala Pro Gly Pro 
805 



<210> 24 

<211> 1780 

<212> DNA 

<213> Homo sapiens 



<400> 24 

aggtcggctg 

ctaggactca 

aagtgcgcgg 

ttgatggata 

gcaaatcgaa 

tcacctaaag 

cacgcaagta 

tttttgaaag 

agccaacagg 

tagataagaa 

atgaaaaaaa 

tttttaacta 

tgaatgttgg 

atgatgatgg 

tctggagtta 

ggaatgaatg 

gtgtcaaatt 

attggttgaa 

atccatatta 

tcagtttcat 

atctaaagtg 



gttatcggga 
cgcaccaggc 
ttggggccac 
aatcaggaat 
gtgataactc 
gggagaaaag 
ttcatcaagt 
attcaagaac 
aagaccacgg 
gaaacaattt 
cgcaccttgg 
tattgaaaaa 
tgaagattta 
atccatttct 
aaggtattgg 
tgatttggac 
gagaaccagg 
gaaattagat 
aaacgagatg 
taaaaagggc 
accttgatgg 



gttggagggc 
cagtcgcggg 
tgcgaggccg 
agattctctt 
ttctgatagc 
aaaccccatt 
gactcatctt 
aggaaaaaga 
ggaagaccag 
agaagcagag 
agaaaaattt 
ctgaagtatg 
gaaaatgaag 
cctattgagg 
catactccac 
tcattcatgt 
tagatcccca 
cccaaagatt 
acagaagcca 
aaacttgaag 
acagtggaag 



tgaggtcggg 
ttttgggccg 
ttttagaaaa 
gaccatgtga 
agcttattta 
cgaaaatttg 
ttgaaccagt 
gatataaaaa 
aaggaaggag 
gatctggctt 
taacgtttga 
aacaccacct 
attttgacag 
agtcaacgct 
acatctgtac 
atgagagtaa 
ccacctacag 
cttggtgaat 
aagtaattat 
ataaatcttt 
aaatcacaac 



agggtggtgt 
aggcctgggt 
ctgtttaaaa 
catctgatgc 
aaactcagtg 
ttcgtacacc 
accattgact 
aaagaaaaag 
aaatcctata 
cccattttta 
gcaagctgtt 
gaaagaatca 
tcgtagatac 
tttatcttga 
cattcttgag 
gcaatgcttt 
taaaaaggac 
tttgaagtct 
gggctgacag 
tgactccagc 
atggaattcc 



gtacagagct 
tacaagcagc 
caaagagcaa 
tgtggaactt 
tatcccttac 
tgaaagtgtt 
ataaaagcta 
aggaggtacc 
tactcactaa 
gaatcagaga 
gcaagaggat 
ttgaagcaaa 
aaat t tt tgg 
ggacatggtg 
tgatcgctta 
tttttccagg 
cctaaagtaa 
tcatcagtat 
gacaactgga 
tctttagagg 
tcgaataaca 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 
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atttattgac 
actatttcta 
aaggaataaa 
acaatctgtt 
cgttgtggcg 
tggtgtctcc 
gaggactctt 
ctgtagacac 
caacaagggg 



tttaaataat 
gaaagtcagc 
aatttgaggt 
gtgcggcgcc 
gcccgggggc 
ccaagagaga 
cacccaaagt 
aacccccaca 
tgcccaacag 



tttgtctaat 
atgtattttt 
ttcaatacaa 
cctgggcccc 
cccacagttg 
gaacacctcc 
atataaaacc 
gtgggaacct 
aaaccccgag 



gctacatata 
ggctcgaagt 
aaacaaaaca 
ttgagagaaa 
ggtttaggtg 
ggggtcaagc 
cgccccgcgg 
ctgagggcgc 
ttaaaaatcg 



cacaattaaa 
ttctctagtg 
aacaacacga 
actttttaga 
ggcacccttg 
ggacaacaag 
gggaaccacc 
acacacaggg 



aaacctttac 
ttttctgtgg 
aacacgaaaa 
accccttttg 
tgtctacaag 
agtgcgtcgt 
ggccgctttt 
cgagccttat 



1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1780 



<210> 25 

<211> 1646 

<212> DNA 

<213> Homo sapiens 

<220> 

<223> MGC5306 



<400> 25 

ggtcggctgg 

taggactcac 

agtgcgcggt 

tgatggataa 

caaatcgaag 

cacctaaagg 

acgcaagtga 

tttttgaaag 

agccaacagg 

tagataagaa 

atgaaaaaaa 

tttttaacta 

tgaatgttgg 

atgatgatgg 

atcttgaaga 

aattccctgt 

tgtctaaaaa 

aggatcatga 

ttgttagatc 

ctaaacgtta 

acatctgtac 

tgagagggtg 

aaagtaaatt 

tcagtatatc 

aactggatca 

ttagaggatc 

aataacaatt 

cctttacact 



ttatcgggag 
gcaccaggcc 
tggggccact 
atcaggaata 
tgataactct 
ggagaaaaga 
ttcatcaagt 
attcaagaac 
aagaccacgg 
gaaacaattt 
cgcaccttgg 
tattgaaaaa 
tgaagattta 
atccatttct 
taacgaatgt 
aagactgagt 
gagagctaca 
atgtcaaagg 
tgtgggaagg 
tgaaaaacca 
cattcttgag 
tcaaattgag 
ggttgaagaa 
catattaaaa 
gtttcattaa 
taaagtgacc 
tattgacttt 
aaaaaaaaaa 



ttggagggct 
agtcgcgggt 
gcgaggccgt 
gattctcttg 
tctgatagca 
aaccccattc 
gactcatctt 
aggaaaaaga 
ggaagaccag 
agaagcagag 
agaaaaattt 
ctgaagtatg 
gaaaatgaag 
cctattgagg 
gatatcaaat 
gtatacttag 
aaagc.caaaa 
cttttatctt 
aattacaaga 
gataatctac 
tgatcgctta 
aaccaggtag 
attagatccc 
cgagatgaca 
aaagggcaaa 
ttgatggaca 
aaataatttt 
aaaaaa 



gaggtcggga 
tttgggccga 
tttagaaaac 
accatgtgac 
gcttatttaa 
gaaaatttgt 
ttgaaccaat 
gatataaaaa 
aaggaaggag 
gatctggctt 
taacgtttga 
aacaccacct 
attttgacag 
agtcaacagc 
tggcagggga 
aagaagagga 
atactggaca 
gagaacatgg 
cagttgctaa 
ttttttacct 
ggaatgaatg 
atccccacca 
aaagattctt 
gaagccaaag 
cttgaagata 
gtggaagaaa 
gtctaatgct 



gggtggtgtg 
ggcctgggtt 
tgtttaaaac 
atctgatgct 
aactcagtgt 
tcgtacacct 
accattgact 
aaagaaaaag 
aaatcctata 
cccattttta 
gcaagctgtt 
gaaagaatca 
tcgtagatac 
agaggatgag 
tagtttcata 
tattactgaa 
gagaggcctg 
tgtctggagt 
aagtttgaaa 
taggtattgg 
tgatttgaac 
cctacagtaa 
ggtgaatttt 
taattatggg 
aatcttttga 
tcacaacatg 
acatatacac 



tacagagctc 
acaagcagca 
aaagagcaat 
gtggaacttg 
atcccttact 
gaaagtgttc 
ataaaagcta 
aggaggtacc 
tactcactaa 
gaatcagaga 
gcaagaggat 
ttgaagcaaa 
aaatttttgg 
gatgcaacac 
gtaagttctg 
gaagctgctt 
aaaatgtgac 
taaaggacta 
aagacggttg 
catactccac 
tcattcatgt 
aaaggaccct 
gaagtcttca 
ctgacaggac 
ctccagctct 
gaattcctcg 
aattaaaaaa 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1646 



<210> 26 
<211> 278 
<212> PRT 

<213> Homo sapiens 
<220> 

<223> MGC5306 
<400> 26 

Met Asp Lys Ser Gly lie Asp Ser Leu Asp His Val Thr Ser Asp Ala 
15 10 15 



28 



Val Glu Leu Ala Asn Arg Ser Asp Asn Ser Ser Asp Ser Ser Leu Phe 
20 25 30 



Lys Thr Gin Cys lie Pro Tyr Ser Pro Lys Gly Glu Lys Arg Asn Pro 
35 40 45 

lie Arg Lys Phe Val Arg Thr Pro Glu Ser Val His Ala Ser Asp Ser 
50 55 60 

Ser Ser Asp Ser Ser Phe Glu Pro lie Pro Leu Thr lie Lys Ala lie 
65 70 75 80 

Phe Glu Arg Phe Lys Asn Arg Lys Lys Arg Tyr Lys Lys Lys Lys Lys 
85 90 95 

Arg Arg Tyr Gin Pro Thr Gly Arg Pro Arg Gly Arg Pro Glu Gly Arg 
100 105 110 

Arg Asn Pro lie Tyr Ser Leu lie 'Asp Lys Lys Lys Gin Phe Arg Ser 
115 120 125 

Arg Gly Ser Gly Phe Pro Phe Leu Glu Ser Glu Asn Glu Lys Asn Ala 
130 135 140 

Pro Trp Arg Lys lie Leu Thr Phe Glu Gin Ala Val Ala Arg Gly Phe 
145 150 155 160 

Phe Asn Tyr lie Glu Lys Leu Lys Tyr Glu His His Leu Lys Glu Ser 
165 170 175 

Leu Lys Gin Met Asn Val Gly Glu Asp Leu Glu Asn Glu Asp Phe Asp 
180 185 190 

Ser Arg Arg Tyr Lys Phe Leu Asp Asp Asp Gly Ser lie Ser Pro lie 
195 200 205 

Glu Glu Ser Thr Ala Glu Asp Glu Asp Ala Thr His Leu Glu Asp Asn 
210 215 220 

Glu Cys Asp lie Lys Leu Ala Gly Asp Ser Phe lie Val Ser Ser Glu 
225 230 235 240 

Phe Pro Val Arg Leu Ser Val Tyr Leu Glu Glu Glu Asp lie Thr Glu 
245 250 255 

Glu Ala Ala Leu Ser Lys Lys Arg Ala Thr Lys Ala Lys Asn Thr Gly 
260 265 270 

Gin Arg Gly Leu Lys Met 
275 



<210> 27 

<211> 2336 

<212> DNA 

<213> Homo sapiens 

<220> 

<223> FLJ23363, clone HEP15507 
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<400> 27 

cttttcttcc gcacggttgg aggaggtcgg ctggttatcg ggagttggag ggctgaggtc 60 
gggagggtgg tgtgtacaga gctctaggac tcacgcacca ggccagtcgc ggattttggg 120 
ccgaggcctg ggttacaagc agcaagtgcg cggttggggc cactgcgagg ccgttttaga 180 
aaactgttta aaacaaagag caattgatgg ataaatcagg aatagattct cttgaccatg 240 
tgacatctga tgctgtggaa cttgcaaatc gaagtgataa ctcttctgat agcagcttat 300 
ttaaaactca gtgtatccct tactcaccta aaggggagaa aagaaacccc attcgaaaat 360 
ttgttcgtac acctgaaagt gttcacgcaa gtgattcatc aagtgactca tcttttgaac 420 
caataccatt gactataaaa gctatttttg aaagattcaa gaacaggaaa aagagatata 480 
aaaaaaagaa aaagaggagg taccagccaa caggaagacc acggggaaga ccagaaggaa 54 0 
ggagaaatcc tatatactca ctaatagata agaagaaaca atttagaagc agaggatctg 600 
gcttcccatt tttagaatca gagaatgaaa aaaacgcacc ttggagaaaa attttaacgt 660 
ttgagcaagc tgttgcaaga ggatttttta actatattga aaagctgaag tatgaacacc 720 
acctgaaaga atcattgaag caaatgaatg ttggtgaaga tttagaaaat gaagattttg 7 80 
acagtcgtag atacaaattt ttggatgatg atggatccat ttctcctatt gaggagtcaa 84 0 
cagcagagga tgaggatgca acacatcttg aagataacga atgtgatatc aaattggcag 900 
gggatagttt catagtaagt tctgaattcc ctgtaagact gagtgtatac ttagaagaag 960 
aggatattac tgaagaagct gctttgtcta aaaagagagc tacaaaagcc aaaaatactg 102 0 
gacagagagg cctgaaaatg tgacaggatc atgaatgtca aaggctttta tcttgagaac 1080 
atggtgtctg gagttaaagg tattggcata ctccacacat ctgtaccatt cttgagtgat 1140 
cgcttaggaa tgaatgtgat ttgaactcat tcatgttgag agggtgtcaa attgagaacc 1200 
aggtagatcc ccaccaccta cagtaaaaag gaccctaaag taaattggtt gaagaaatta 1260 
gatcccaaag attcttggtg aattttgaag tcttcatcag tatatccata ttaaaacgag 1320 
atgacagaag ccaaagtaat tatggcaagt aatggttttt atcttaacta taagttattt 1380 
gctcaagggt gtaatggtca ttaccaaggc ttttagaatg cagtttctca tttgctgtgg 144 0 
acatgaccat aaaaaaaaat ttcccagtag gttttctatc tgctacgttg ctagcaatca 1500 
gcttattggg aacagttgat taactgtaat agaaatgcaa tacaaataaa atgtgaacca 1560 
catgtgattt ttctttaaaa tcagtgagat ttgaaaattc tcctagatct cttgaatcat 1620 
gcaaatttgc tttgccttta tattgtaacc cttgtgggtt gctaataacc aagcagtttg 168 0 
tagtagagtt aactcaggct cgttctaggg actcattcat gttcactcac tgtacactca 174 0 
tctctggaaa tgtaaaattt acttttatac tattgttatg tagggctgac aggacaactg 1800 
gatcagtttc attaaaaagg tatgtatgca ttagaaaaga catttgtatg ggtcatttca 1860 
aagagggctt atgaggctgt gaaacccaga gctcttaacg ctgtgaccaa agatggaagt 192 0 
tctctatagg aagccatagc actcctaatg tttggtgcta tgttttcctg aggagatata 1980 
aaacgtaata atccatgatt gttgccatgt gagagtttta aaggttaatc aaaatttctc 2 04 0 
ttcttcaggg caaacttgaa gataaatctt ttgactccag ctctttagag gatctaaagt 2100 
gaccttgatg gacagtggaa gaaatcacaa catggaattc ctcgaataac aatttattga 2160 
ctttaaataa ttttgtctaa tgctacatat acacaattaa aaaaccttta cactatttct 2220 
agaaagtcag catgtatttt tggctcgaag tttctctagt gttttctgtg gaaggaataa 2280 
aaatttgagt ttcaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaa 23 36 



<210> 28 
<211> 278 
<212> PRT 

<213> Homo sapiens 

<220> 

<223> FLJ23363, clone HEP15507, MGC5306 
<400> 28 

Met Asp Lys Ser Gly lie Asp Ser Leu Asp His Val Thr Ser Asp Ala 
15 10 15 

Val Glu Leu Ala Asn Arg Ser Asp Asn Ser Ser Asp Ser Ser Leu Phe 
20 25 30 

Lys Thr Gin Cys lie Pro Tyr Ser Pro Lys Gly Glu Lys Arg Asn Pro 
35 40 45 
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lie Arg Lys Phe Val Arg Thr Pro Glu Ser Val His Ala Ser Asp Ser 
50 55 60 

Ser Ser Asp Ser Ser Phe Glu Pro lie Pro Leu Thr lie Lys Ala lie 
65 70 75 80 



Phe Glu Arg Phe Lys Asn Arg Lys Lys Arg Tyr Lys Lys Lys Lys Lys 
85 90 95 

Arg Arg Tyr Gin Pro Thr Gly Arg Pro Arg Gly Arg Pro Glu Gly Arg 
100 105 110 

Arg Asn Pro lie Tyr Ser Leu lie Asp Lys Lys Lys Gin Phe Arg Ser 

115 120 125 

Arg Gly Ser Gly Phe Pro Phe Leu Glu Ser Glu Asn Glu Lys Asn Ala 
130 135 140 

Pro Trp Arg Lys lie Leu Thr Phe Glu Gin Ala Val Ala Arg Gly Phe 
145 150 155 160 

Phe Asn Tyr lie Glu Lys Leu Lys Tyr Glu His His Leu Lys Glu Ser 
165 170 175 

Leu Lys Gin Met Asn Val Gly Glu Asp Leu Glu Asn Glu Asp Phe Asp 
180 185 190 

Ser Arg Arg Tyr Lys Phe Leu Asp Asp Asp Gly Ser lie Ser Pro lie 

195 200 205 

Glu Glu Ser Thr Ala Glu Asp Glu Asp Ala Thr His Leu Glu Asp Asn 
210 215 220 

Glu Cys Asp lie Lys Leu Ala Gly Asp Ser Phe lie Val Ser Ser Glu 
225 230 235 240 

Phe Pro Val Arg Leu Ser Val Tyr Leu Glu Glu Glu Asp lie Thr Glu 
245 ^ 250 255 

Glu Ala Ala Leu Ser Lys Lys Arg Ala Thr Lys Ala Lys Asn Thr Gly 
260 265 270 



Gin Arg Gly Leu Lys Met 
275 



<210> 29 

<211> 2174 

<212> DNA 

<213> Homo sapiens 



<220> 

<223> nuclear factor (erythroid-derived 2) -like 3 
(NFE2L3) 



<400> 29 

gccgccgcct cgtccaccgg aggagccggc 
caggggggcg gcggggaccc ccgagcggct 
gagaaggcac ccgcggaacc gacggctcag 
gagaatgggg tactaagaga aaagcacgaa 
aatgaagaaa gggtgtcagc ccagaaggag 



gccagcgtgg acggcggcag ccaggctgtg 60 
cggagtggcc ccttggacgc cggggaagag 12 0 
gtgccggacg ctggcggatg tgcgagcgag 180 
gctgtggatc atagttccca gcatgaggaa 24 0 
aactcacttc agcagaatga tgatgatgaa 3 00 
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aacaaaatag 
agacatctga 
tcacagcctg 
atcagtgatg 
caggatgtga 
ccaacagcaa 
aatcctgagc 
catatgagga 
gagataaact 
ctttttgatg 
tctgtcatca 
actgaccatg 
gaacccagta 
tttcaacacg 
tctgaacctt 
acagatagaa 
gtagatgaaa 
ctgacagacc 
gctgcgcaga 
tgtaacttgc 
attaacataa 
gatgaccaag 
agtatcttga 
ggaaagagaa 
gaaactgatt 
ctgctacttg 
cttcaagatc 
atacaaaatt 
catcttggca 
ggatcaaatt 
agtttgttag 
aaattttgct 



cagagaaacc 
atgggacaga 
aaaattcact 
gcatgaattc 
atcttcatga 
ggacttcaca 
aaacccttcc 
atctaacaag 
taatgtcatt 
aaccagattc 
agtctaattc 
aatctagttc 
agctttgtca 
tatttcataa 
ttccgtggcc 
acttgagccg 
ttgtcggcat 
tacaagtctc 
actgtcgtaa 
aagcaaagaa 
tgaaacagaa 
gtaggccagt 
tagtacccaa 
agtgagaaga 
atttggatca 
aataactcag 
acacttgtgg 
catagttatg 
gccatccttt 
attttaagag 
cactatagtg 
ttct 



tgactgggag 
tacttctttc 
ggagggcatc 
ttcagcacat 
ggccatcttg 
gtcacaagaa 
tggaactaat 
ccaagaccta 
ggccacagaa 
tgattctggc 
ctctcactct 
ccatcatgac 
cttggatcaa 
ccacacttac 
tgggaagtca 
tgatgaacag 
gcctgttgat 
acttatccgt 
acgcaaattg 
ggaaactctt 
actgcatgac 
caatcccaac 
agaactggtg 
aactgaagat 
gaaaccattg 
ttaacgctgt 
gcaatctggg 
tccaaagaat 
ttaagagtaa 
gtatttcagt 
agcttttcaa 



gcagaaaaga 
tctctggaag 
tcattgggag 
tatcatgtaa 
ctttgtccca 
ccatttctgc 
ttgacaggat 
ctgtatgacc 
gacaactttg 
ctttctttag 
gtgtgtgatg 
ttagaaggtg 
agtgattctg 
cacttacagc 
cagaagataa 
cgtgctaaag 
tctttcaata 
gacatcagac 
gacataattt 
aagagagagc 
ctttatcatg 
cactatgctc 
gcctcaggcc 
ggactctatt 
aaactgcttc 
tttgaagctt 
ggagccacaa 
aggttaacat 
gttggttact 
tttaaatgca 
acactatttt 



ccactgaatc 
acttattcca 
atattcctct 
acttcagcca 
acaatacatt 
agttaaattc 
ttctttcacc 
ttgacataaa 
atccaatcga 
attcaagtca 
aaggtgctat 
ctgtaggtgg 
atttccatgg 
caactgcacc 
ggagtagata 
ctttgcatat 
gcatgttaag 
gaagagggaa 
tgaatttaga 
aagcacaatg 
atatttttag 
tccagtgtac 
acaaaaagga 
atgtgaagta 
aagaattgta 
acatggacaa 
cttttcatga 
gaaaacccag 
tcaaaaagag 
aaatagcctt 
aatctttata 



tagaaatgag 
gttgctttca 
tccaggcagt 
ggctataagt 
tagaagagat 
tcataccacc 
ggttgacaat 
tatatttgat 
tgtttctcag 
caataatacc 
aggttattgc 
ctactaccca 
agatcttaca 
agaatctact 
ccttgaagac 
ccctttttct 
tagatattat 
aaataaagtt 
agatgatgta 
taacaaagct 
tagattaaga 
ccatgatgga 
aacccaaaag 
gtaatgttca 
tctttaagta 
atgtttagga 
agtgcattgt 
taagactttc 
caaacactgg 
attttcattt 
tttaacttat 



360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2174 



<210> 30 
<211> 400 
<212> PRT 

<213> Homo sapiens 
<220> 

<223> nuclear factor (erythroid- derived 2) -like 3 
(NFE2L3) 

<400> 30 

Met Asn Ser Ser Ala His Tyr His Val Asn Phe Ser Gin Ala lie Ser 
15 10 15 

Gin Asp Val Asn Leu His Glu Ala lie Leu Leu Cys Pro Asn Asn Thr 
20 25 30 

Phe Arg Arg Asp Pro Thr Ala Arg Thr Ser Gin Ser Gin Glu Pro Phe 
35 40 45 

Leu Gin Leu Asn Ser His Thr Thr Asn Pro Glu Gin Thr Leu Pro Gly 
50 55 60 

Thr Asn Leu Thr Gly Phe Leu Ser Pro Val Asp Asn His Met Arg Asn 
65 70 75 80 

Leu Thr Ser Gin Asp Leu Leu Tyr Asp Leu Asp lie Asn lie Phe Asp 
85 90 95 
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Glu lie Asn Leu Met Ser Leu Ala Thr Glu Asp Asn Phe Asp Pro lie 
100 105 110 



Asp Val Ser Gin Leu Phe Asp Glu Pro Asp Ser Asp Ser Gly Leu Ser 
115 120 125 

Leu Asp Ser Ser His Asn Asn Thr Ser Val lie Lys Ser Asn Ser Ser 
130 135 140 

His Ser Val Cys Asp Glu Gly Ala lie Gly Tyr Cys Thr Asp His Glu 
145 150 155 160 

Ser Ser Ser His His Asp Leu Glu Gly Ala Val Gly Gly Tyr Tyr Pro 
165 170 175 

Glu Pro Ser Lys Leu Cys His Leu Asp Gin Ser Asp Ser Asp Phe His 
180 185 190 

Gly Asp Leu Thr Phe Gin His Val Phe His Asn His Thr Tyr His Leu 
195 200 205 

Gin Pro Thr Ala Pro Glu Ser Thr Ser Glu Pro Phe Pro Trp Pro Gly 
210 215 220 

Lys Ser Gin Lys lie Arg Ser Arg Tyr Leu Glu Asp Thr Asp Arg Asn 
225 230 235 240 

Leu Ser Arg Asp Glu Gin Arg Ala Lys Ala Leu His lie Pro Phe Ser 
245 250 255 

Val Asp Glu lie Val Gly Met Pro Val Asp Ser Phe Asn Ser Met Leu 
260 265 270 

Ser Arg Tyr Tyr Leu Thr Asp Leu Gin Val Ser Leu lie Arg Asp lie 
275 280 285 

Arg Arg Arg Gly Lys Asn Lys Val Ala Ala Gin Asn Cys Arg Lys Arg 
290 295 300 

Lys Leu Asp lie lie Leu Asn Leu Glu Asp Asp Val Cys Asn Leu Gin 
305 310 315 320 

Ala Lys Lys Glu Thr Leu Lys Arg Glu Gin Ala Gin Cys Asn Lys Ala 
325 330 335 

lie Asn He Met Lys Gin Lys Leu His Asp Leu Tyr His Asp He Phe 
340 345 350 

Ser Arg Leu Arg Asp Asp Gin Gly Arg Pro Val Asn Pro Asn His Tyr 
355 360 365 

Ala Leu Gin Cys Thr His Asp Gly Ser He Leu He Val Pro Lys Glu 
370 375 380 

Leu Val Ala Ser Gly His Lys Lys Glu Thr Gin Lys Gly Lys Arg Lys 
385 390 395 400 
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<210> 31 
<211> 1209 

<212> DNA 

<213> Homo sapiens 
<220> 

<223> winged helix/f orkhead transcription factor (HFHl) 
<220> 

<221> modif ied_base 

<222> (1161) 

<223> n = g, a, c or t 

<400> 31 

atgaagttgg aggtgttcgt ccctcgcgcg gcccacgggg acaagcaggg cagtgacctg 60 
gagggcgcgg gcggcagcga cgcgccgtcc ccgctgtcgg cggcgggaga cgactccctg 12 0 
ggctcagatg gggactgcgc ggccaagccg tccgcgggcg gcggcgccag agatacgcag 180 
ggcgacggcg aacagagtgc gggaggcggg ccgggcgcgg aggaggcgat cccggcagca 24 0 
gctgctgcag cggtggtggc ggagggcgcg gaggccgggg cggcggggcc aggcgcgggc 3 00 
ggcgcgggga gcggcgaggg tgcacgcagc aagccatata cgcggcggcc caagcccccc 3 60 
tactcgtaca tcgcgctcat cgccatggcc atccgcgact cggcgggcgg gcgcttgacg 420 
ctggcggaga tcaacgagta cctcatgggc aagttcccct ttttccgcgg cagctacacg 480 
ggctggcgca actccgtgcg ccacaacctt tcgctcaacg actgcttcgt caaggtgctg 540 
cgcgacccct cgcggccctg gggcaaggac aactactgga tgctcaaccc caacagcgag 600 
tacaccttcg ccgacggggt cttccgccgc cgccgcaagc gcctcagcca ccgcgcgccg 660 
gtccccgcgc ccgggctgcg gcccgaggag gccccgggcc tccccgccgc cccgccgccc 720 
gcgcccgccg ccccggcctc gccccgcatg cgctcgcccg cccgccagga ggagcgcgcc 780 
agccccgcgg gcaagttctc cagctccttc gccatcgaca gcatcctgcg caagcccttc 840 
cgcagccgtc gcctcaggga cacggccccc gggacgacgc ttcagtgggg cgccgcgccc 900 
tgcccgccgc tgcccgcgtt ccccgcgctc ctccccgcgg cgccctgcag ggccctgctg 96 0 
ccgctctgcg cgtacggcgc gggcgagccg gcgcggctgg gcgcgcgcga ggccgaggtg 1020 
ccaccgaccg cgccgcccct cctgcttgca cctctcccgg cggcggcccc cgccaagcca 1080 
ctccgaggcc cggcggccgg cggcgcgcac ctgtactgcc ccctgcggct gcccgcagcc 1140 
ctgcaggcgg ccttagtccg ncgtcctggc ccgcacctgt cgtacccggt ggagacgctc 12 00 
ctagcttga 1209 



<210> 32 
<211> 402 
<212> PRT 

<213> Homo sapiens 
<220> 

<223> winged helix/f orkhead transcription factor (HFHl) 
<400> 32 

Met Lys Leu Glu Val Phe Val Pro Arg Ala Ala His Gly Asp Lys Gin 
1.5 10 15 

Gly Ser Asp Leu Glu Gly Ala Gly Gly Ser Asp Ala Pro Ser Pro Leu 
20 25 30 

Ser Ala Ala Gly Asp Asp Ser Leu Gly Ser Asp Gly Asp Cys Ala Ala 
35 40 45 

Lys Pro Ser Ala Gly Gly Gly Ala Arg Asp Thr Gin Gly Asp Gly Glu 
50 55 60 

Gin Ser Ala Gly Gly Gly Pro Gly Ala Glu Glu Ala lie Pro Ala Ala 
65 70 75 80 
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Ala Ala Ala Ala 



Pro Gly Ala Gly 
100 

Tyr Thr Arg Arg 
115 

Met Ala lie Arg 
130 

Asn Glu Tyr Leu 
145 

Gly Trp Arg Asn 



Val Lys Val Leu 
180 

Trp Met Leu Asn 
195 

Arg Arg Arg Arg 
210 

Gly Leu Arg Pro 
225 

Ala Pro Ala Ala 



Glu Glu Arg Ala 
260 

Asp Ser lie Leu 
275 

Ala Pro Gly Thr 
290 

Pro Ala Phe Pro 
305 

Pro Leu Cys Ala 



Glu Ala Glu Val 
340 

Pro Ala Ala Ala 
355 

Ala His Leu Tyr 
370 

Leu Val Arg Arg 
385 

Leu Ala 



Val Val Ala Glu 
85 

Gly Ala Gly Ser 



Pro Lys Pro Pro 
120 

Asp Ser Ala Gly 
^ 135 

Met Gly Lys Phe 
150 

Ser Val Arg His 
165 

Arg Asp Pro Ser 



Pro Asn Ser Glu 
200 

Lys Arg Leu Ser 
215 

Glu Glu Ala Pro 
230 

Pro Ala Ser Pro 
245 

Ser Pro Ala Gly 



Arg Lys Pro Phe 
280 

Thr Leu Gin Trp 
295 

Ala Leu Leu Pro 
310 

Tyr Gly Ala Gly 
325 

Pro Pro Thr Ala 



Pro Ala Lys Pro 
360 

Cys Pro Leu Arg 
375 

Pro Gly Pro His 
390 



Gly Ala Glu Ala 
90 

Gly Glu Gly Ala 
105 

Tyr Ser Tyr lie 



Gly Arg Leu Thr 
140 

Pro Phe Phe Arg 

155 

Asn Leu Ser Leu 
170 

Arg Pro Trp Gly 
185 

Tyr Thr Phe Ala 



His Arg Ala Pro 
220 

Gly Leu Pro Ala 

235 

Arg Met Arg Ser 
250 

Lys Phe Ser Ser 
265 

Arg Ser Arg Arg 



Gly Ala Ala Pro 
300 

Ala Ala Pro Cys 

315 

Glu Pro Ala Arg 
330 

Pro Pro Leu Leu 
345 

Leu Arg Gly Pro 



Leu Pro Ala Ala 
380 

Leu Ser Tyr Pro 
395 



Gly Ala Ala Gly 
95 

Arg Ser Lys Pro 
110 

Ala Leu lie Ala 
125 

Leu Ala Glu lie 



Gly Ser Tyr Thr 

160 

Asn Asp Cys Phe 
175 

Lys Asp Asn Tyr 
190 

Asp Gly Val Phe 
205 

Val Pro Ala Pro 



Ala Pro Pro Pro 
240 

Pro Ala Arg Gin 
255 

Ser Phe Ala lie 
270 

Leu Arg Asp Thr 
285 

Cys Pro Pro Leu 



Arg Ala Leu Leu 
320 

Leu Gly Ala Arg 
335 

Leu Ala Pro Leu 
350 

Ala Ala Gly Gly 
365 

Leu Gin Ala Ala 



Val Glu Thr Leu 
400 
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<210> 33 

<211> 3218 

<212> DNA 

<213> Homo sapiens 

<220> 

<223> chloride channel, calcium activatd, family member 
4 (CLCA4) 

<400> 33 

gaacaaacca acatttgagc caggaataac tagagaggaa caatggggtt attcagaggt 60 
tttgttttcc tcttagttct gtgcctgctg caccagtcaa atacttcctt cattaagctg 120 
aataataatg gctttgaaga tattgtcatt gttatagatc ctagtgtgcc agaagatgaa 180 
aaaataattg aacaaataga ggatatggtg actacagctt ctacgtacct gtttgaagcc 240 
acagaaaaaa gatttttttt caaaaatgta tctatattaa ttcctgagaa ttggaaggaa 3 00 
aatcctcagt acaaaaggcc aaaacatgaa aaccataaac atgctgatgt tatagttgca 360 
ccacctacac tcccaggtag agatgaacca tacaccaagc agttcacaga atgtggagag 420 
aaaggcgaat acattcactt cacccctgac cttctacttg gaaaaaaaca aaatgaatat 480 
ggaccaccag gcaaactgtt tgtccatgag tgggctcacc tccggtgggg agtgtttgat 54 0 
gagtacaatg aagatcagcc tttctaccgt gctaagtcaa aaaaaatcga agcaacaagg 6 00 
tgttccgcag gtatctctgg tagaaataga gtttataagt gtcaaggagg cagctgtctt 660 
agtagagcat gcagaattga ttctacaaca aaactgtatg gaaaagattg tcaattcttt 720 
cctgataaag tacaaacaga aaaagcatcc ataatgttta tgcaaagtat tgattctgtt 780 
gttgaatttt gtaacgaaaa aacccataat caagaagctc caagcctaca aaacataaag 84 0 
tgcaatttta gaagtacatg ggaggtgatt agcaattctg aggattttaa aaacaccata 900 
cccatggtga caccacctcc tccacctgtc ttctcattgc tgaagatccg tcaaagaatt 960 
gtgtgcttag ttcttgataa gtctggaagc atggggggta aggaccgcct aaatcgaatg 102 0 
aatcaagcag caaaacattt cctgctgcag actgttgaaa atggatcctg ggtggggatg 1080 
gttcactttg atagtactgc cactattgta aataagctaa tccaaataaa aagcagtgat 1140 
gaaagaaaca cactcatggc aggattacct acatatcctc tgggaggaac ttccatctgc 12 0 0 
tctggaatta aatatgcatt tcaggtgatt ggagagctac attcccaact cgatggatcc 1260 
gaagtactgc tgctgactga tggggaggat aacactgcaa gttcttgtat tgatgaagtg 132 0 
aaacaaagtg gggccattgt tcattttatt gctttgggaa gagctgctga tgaagcagta 1380 
atagagatga gcaagataac aggaggaagt catttttatg tttcagatga agctcagaac 144 0 
aatggcctca ttgatgcttt tggggctctt acatcaggaa atactgatct ctcccagaag 1500 
tcccttcagc tcgaaagtaa gggattaaca ctgaatagta atgcctggat gaacgacact 1560 
gtcataattg atagtacagt gggaaaggac acgttctttc tcatcacatg gaacagtctg 162 0 
cctcccagta tttctctctg ggatcccagt ggaacaataa tggaaaattt cacagtggat 1680 
gcaacttcca aaatggccta tctcagtatt ccaggaactg caaaggtggg cacttgggca 174 0 
tacaatcttc aagccaaagc gaacccagaa acattaacta ttacagtaac ttctcgagca 1800 
gcaaattctt ctgtgcctcc aatcacagtg aatgctaaaa tgaataagga cgtaaacagt 1860 
ttccccagcc caatgattgt ttacgcagaa attctacaag gatatgtacc tgttcttgga 1920 
gccaatgtga ctgctttcat tgaatcacag aatggacata cagaagtttt ggaacttttg 1980 
gataatggtg caggcgctga ttctttcaag aatgatggag tctactccag gtattttaca 2040 
gcatatacag aaaatggcag atatagctta aaagttcggg ctcatggagg agcaaacact 210 0 
gccaggctaa aattacggcc tccactgaat agagccgcgt acataccagg ctgggtagtg 2160 
aacggggaaa ttgaagcaaa cccgccaaga cctgaaattg atgaggatac tcagaccacc 2220 
ttggaggatt tcagccgaac agcatccgga ggtgcatttg tggtatcaca agtcccaagc 2280 
cttcccttgc ctgaccaata cccaccaagt caaatcacag accttgatgc cacagttcat 2340 
gaggataaga ttattcttac atggacagca ccaggagata attttgatgt tggaaaagtt 2400 
caacgttata tcataagaat aagtgcaagt attcttgatc taagagacag ttttgatgat 2460 
gctcttcaag taaatactac tgatctgtca ccaaaggagg ccaactccaa ggaaagcttt 2520 
gcatttaaac cagaaaatat ctcagaagaa aatgcaaccc acatatttat tgccattaaa 2580 
agtatagata aaagcaattt gacatcaaaa gtatccaaca ttgcacaagt aactttgttt 2640 
atccctcaag caaatcctga tgacattgat cctactccta ctcctactcc tactcctgat 2700 
aaaagtcata attctggagt taatatttct acgctggtat tgtctgtgat tgggtctgtt 2760 
gtaattgtta actttatttt aagtaccacc atttgaacct taacgaagaa aaaaatcttc 2820 
aagtagacct agaagagagt tttaaaaaac aaaacaatgt aagtaaagga tatttctgaa 2880 
tcttaaaatt catcccatgt gtgatcataa actcataaaa ataattttaa gatgtcggaa 2940 
aaggatactt tgattaaata aaaacactca tggatatgta aaaactgtca agattaaaat 3000 
ttaatagttt catttatttg ttattttatt tgtaagaaat agtgatgaac aaagatcctt 3060 



36 



tttcatactg atacctggtt gtatattatt tgatgcaaca gttttctgaa atgatatttc 3120 
aaattgcatc aagaaattaa aatcatctat ctgagtagtc aaaatacaag taaaggagag 3180 
caaataaaca acatttggaa aaaaaaaaaa aaaaaaaa 3218 



<210> 34 
<211> 917 
<212> PRT 

<213> Homo sapiens 



<220> 

<223> chloride channel, calcium activated, family member 
4 (CLCA4) 



<400> 34 
Met Gly Leu Phe 
1 

His Gin Ser Asn 
20 

Asp lie Val lie 
35 

lie Glu Gin lie 
50 

Glu Ala Thr Glu 
65 

Pro Glu Asn Trp 



Asn His Lys His 
100 

Arg Asp Glu Pro 
115 



Arg Gly Phe Val 
5 

Thr Ser Phe lie 



Val lie Asp Pro 
40 

Glu Asp Met Val 
55 

Lys Arg Phe Phe 
70 

Lys Glu Asn Pro 

85 

Ala Asp Val He 



Tyr Thr Lys Gin 
120 



Phe Leu Leu Val 
10 

Lys Leu Asn Asn 
25 

Ser Val Pro Glu 



Thr Thr Ala Ser 
60 

Phe Lys Asn Val 
75 

Gin Tyr Lys Arg 
90 

Val Ala Pro Pro 
105 

Phe Thr Glu Cys 



Leu Cys Leu Leu 
15 

Asn Gly Phe Glu 
30 

Asp Glu Lys He 
45 

Thr Tyr Leu Phe 



Ser He Leu He 
80 

Pro Lys His Glu 
95 

Thr Leu Pro Gly 
110 

Gly Glu Lys Gly 
125 



Glu Tyr He His Phe Thr Pro Asp 
130 135 

Glu Tyr Gly Pro Pro Gly Lys Leu 
145 150 

Arg Trp Gly Val Phe Asp Glu Tyr 

165 



Leu Leu Leu Gly Lys Lys Gin Asn 
140 

Phe Val His Glu Trp Ala His Leu 
155 160 

Asn Glu Asp Gin Pro Phe Tyr Arg 
170 175 



Ala Lys Ser Lys Lys He Glu Ala 
180 

Gly Arg Asn Arg Val Tyr Lys Cys 
195 200 

Ala Cys Arg He Asp Ser Thr Thr 
210 215 

Phe Phe Pro Asp Lys Val Gin Thr 
225 230 



Thr Arg Cys Ser Ala Gly He Ser 
185 190 

Gin Gly Gly Ser Cys Leu Ser Arg 
205 

Lys Leu Tyr Gly Lys Asp Cys Gin 
220 

Glu Lys Ala Ser He Met Phe Met 
235 240 



37 



Gin Ser lie Asp 



Gin Glu Ala Pro 
260 

Trp Glu Val lie 
275 

Val Thr Pro Pro 
290 

Arg lie Val Cys 
305 

Asp Arg Leu Asn 



Thr Val Glu Asn 
340 

Ala Thr lie Val 
355 

Asn Thr Leu Met 
370 

lie Cys Ser Gly 
385 

Ser Gin Leu Asp 



Asn Thr Ala Ser 
420 

Val His Phe lie 
435 

Met Ser Lys lie 
450 

Gin Asn Asn Gly 
465 

Thr Asp Leu Ser 



Leu Asn Ser Asn 
500 

Val Gly Lys Asp 
515 

Ser lie Ser Leu 
530 

Val Asp Ala Thr 
545 



Ser Val Val Glu 
245 

Ser Leu Gin Asn 



Ser Asn Ser Glu 
280 

Pro Pro Pro Val 
295 

Leu Val Leu Asp 
310 

Arg Met Asn Gin 
325 

Gly Ser Trp Val 



Asn Lys Leu lie 
360 

Ala Gly Leu Pro 
375 

lie Lys Tyr Ala 
390 

Gly Ser Glu Val 
405 

Ser Cys lie Asp 



Ala Leu Gly Arg 
440 

Thr Gly Gly Ser 
455 

Leu lie Asp Ala 
470 

Gin Lys Ser Leu 

485 

Ala Trp Met Asn 



Thr Phe Phe Leu 
520 

Trp Asp Pro Ser 
535 

Ser Lys Met Ala 
550 



Phe Cys Asn Glu 
250 

lie Lys Cys Asn 
265 

Asp Phe Lys Asn 



Phe Ser Leu Leu 
300 

Lys Ser Gly Ser 
315 

Ala Ala Lys His 
330 

Gly Met Val His 
345 

Gin lie Lys Ser 



Thr Tyr Pro Leu 
380 

Phe Gin Val lie 
395 

Leu Leu Leu Thr 
410 

Glu Val Lys Gin 
425 

Ala Ala Asp Glu 



His Phe Tyr Val 
460 

Phe Gly Ala Leu 
475 

Gin Leu Glu Ser 
490 

Asp Thr Val lie 
505 

lie Thr Trp Asn 



Gly Thr He Met 
540 

Tyr Leu Ser He 
555 



Lys Thr His Asn 
255 

Phe Arg Ser Thr 
270 

Thr He Pro Met 
285 

Lys lie Arg Gin 



Met Gly Gly Lys 

320 

Phe Leu Leu Gin 
335 

Phe Asp Ser Thr 
350 

Ser Asp Glu Arg 
365 

Gly Gly Thr Ser 



Gly Glu Leu His 
400 

Asp Gly Glu Asp 
415 

Ser Gly Ala He 
430 

Ala Val He Glu 
445 

Ser Asp Glu Ala 



Thr Ser Gly Asn 
480 

Lys Gly Leu Thr 
495 

He Asp Ser Thr 
510 

Ser Leu Pro Pro 
525 

Glu Asn Phe Thr 



Pro Gly Thr Ala 
560 
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Lys Val Gly Thr 



Thr Leu Thr lie 
580 

Pro lie Thr Val 
595 

Ser Pro Met lie 
610 

Leu Gly Ala Asn 
625 

Glu Val Leu Glu 



Asn Asp Gly Val 
660 

Arg Tyr Ser Leu 
675 

Leu Lys Leu Arg 
690 

Val Val Asn Gly 
705 

Glu Asp Thr Gin 



Gly Ala Phe Val 
740 

Tyr Pro Pro Ser 
755 

Lys lie lie Leu 
770 

Lys Val Gin Arg 
785 

Arg Asp Ser Phe 



Pro Lys Glu Ala 
820 

lie Ser Glu Glu 
835 

Asp Lys Ser Asn 
850 

Leu Phe lie Pro 
865 



Trp Ala Tyr Asn 
565 

Thr Val Thr Ser 



Asn Ala Lys Met 
600 

Val Tyr Ala Glu 
615 

Val Thr Ala Phe 
630 

Leu Leu Asp Asn 
645 

Tyr Ser Arg Tyr 



Lys Val Arg Ala 
680 

Pro Pro Leu Asn 
695 

Glu lie Glu Ala 
710 

Thr Thr Leu Glu 

725 

Val Ser Gin Val 



Gin lie Thr Asp 
760 

Thr Trp Thr Ala 
775 

Tyr lie lie Arg 
790 

Asp Asp Ala Leu 
805 

Asn Ser Lys Glu 



Asn Ala Thr His 
840 

Leu Thr Ser Lys 
855 

Gin Ala Asn Pro 
870 



Leu Gin Ala Lys 
570 

Arg Ala Ala Asn 
585 

Asn Lys Asp Val 



lie Leu Gin Gly 
620 

lie Glu Ser Gin 
635 

Gly Ala Gly Ala 
650 

Phe Thr Ala Tyr 
665 

His Gly Gly Ala 



Arg Ala Ala Tyr 
700 

Asn Pro Pro Arg 
715 

Asp Phe Ser Arg 
730 

Pro Ser Leu Pro 
745 

Leu Asp Ala Thr 



Pro Gly Asp Asn 
780 

lie Ser Ala Ser 
795 

Gin Val Asn Thr 
810 

Ser Phe Ala Phe 
825 

lie Phe lie Ala 



Val Ser Asn lie 
860 

Asp Asp lie Asp 
875 



Ala Asn Pro Glu 
575 

Ser Ser Val Pro 
590 

Asn Ser Phe Pro 
605 

Tyr Val Pro Val 



Asn Gly His Thr 
640 

Asp Ser Phe Lys 
655 

Thr Glu Asn Gly 
670 

Asn Thr Ala Arg 
685 

lie Pro Gly Trp 



Pro Glu lie Asp 
720 

Thr Ala Ser Gly 
735 

Leu Pro Asp Gin 
750 

j 

Val His Glu Asp j 
765 I 

I 

Phe Asp Val Gly 



lie Leu Asp Leu 
800 

Thr Asp Leu Ser 
815 

Lys Pro Glu Asn 
830 

lie Lys Ser lie 
845 

Ala Gin Val Thr 



Pro Thr Pro Thr 
880 
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Pro Thr Pro Thr 



Thr Leu Val Leu 
900 

Leu Ser Thr Thr 
915 



Pro Asp Lys Ser 
885 

Ser Val lie Gly 
He 



His Asn Ser Gly 
890 

Ser Val Val He 
905 



Val Asn He Ser 
895 

Val Asn Phe He 
910 



<210> 35 

<211> 5749 

<212> DNA 

<213> Homo sapiens 
<220> 

<223> nuclear receptor subfamily 3, group C, member 2 
(NR3C2) 



<400> 35 

cgcgggagcc 

gagcgttgct 

ggttggattt 

gtagacggcg 

ctccctgaag 

tcttccctgg 

agctgtgttt 

gaactactcc 

aaaactgagc 

atggattctg 

agtccagcta 

catcgtcctt 

gggagctccg 

gagaaaagcc 

ggaatcaact 

atcacccagg 

cacagccctg 

aaatcctcaa 

aataatgtca 

tctgtttcca 

actgtgggat 

tctgctggat 

gctcaagagg 

actggccagc 

tcatgtctag 

caagaatcaa 

ccgtttccat 

ctatcaggaa 

ttcccagtgg 

ccttcctctg 

gctcaaggta 

tcctttcctc 

agaagaagtg 

ttacgaagtg 

gaggcttcag 

agagcagtgg 

aagattcgac 

aatttaggag 

ccacagcagc 

acgtacatcg 

acaatctcac 



aacttcaggc 
gcctcgggct 
aggggggctc 
agagacggac 
gtctagatat 
gacctacaga 
ccggtgctat 
cttgccttca 
tggaatctaa 
taagagatgc 
agatttatca 
ccactctaag 
tgaatggtgg 
cgtctgtttg 
ctgtgtcctc 
gaactcctct 
cacatgctag 
tttccagccc 
ctctgagatc 
gcccttcgaa 
ctatctgtag 
ccagtacatt 
tcccttttcc 
ttaatattgt 
gaggaaatag 
ccaagcattc 
ttatggatgg 
ttttaggacc 
gtattaaaca 
ctattgttgg 
caatatcttt 
ctgtcaatac 
atgggtatcc 
tttctactgg 
gatgccatta 
aagggcaaca 
gaaagaattg 
cacgaaagtc 
agcagccccc 
ctcctgcaaa 
gagcgctcac 



tgctcagagg 
ctcccgctgc 
cgcagcaggg 
cccggccgag 
ggaaagacgg 
gaggaccgat 
tccaaacaac 
gcaagacaat 
ggaactttca 
tgactattcc 
gaatgttgaa 
ttgtgtgaac 
cgtcatgcgc 
cagccctctg 
caccacagcc 
gacatgctcc 
caatgtgggc 
tccaagtcac 
ctctgtgtct 
cactaataac 
ccctgtaaac 
gcgggatgtg 
taagactgag 
ccagtacata 
caaaataaat 
atgttcaggc 
ctcgtatttt 
acctgtgccc 
agaaccagat 
ggtgaattca 
atcacgatcg 
tttagtggag 
ggtcttagaa 
atcttcaaga 
tggggtagtc 
caactattta 
tcctgcttgc 
aaagaagttg 
acccccaccc 
agaaccctcg 
accttccccc 



aagcccgtgc 
agggagagcg 
gtttcgtggc 
gcagggatgg 
tggggtcaag 
gagaataact 
agtactcaag 
aatcggcctg 
gcaactgtag 
tatgagcagc 
cagctggtga 
acgcccttga 
gccattgtta 
aacatgacat 
agctttggca 
cctaatgctg 
tctcctctct 
tgcagtgtaa 
agccctgcaa 
agatccacgc 
aatgccttca 
gttcccagtc 
gaagtagaga 
aaaccagaac 
tcggattctt 
acctctttta 
tcctttatgg 
ggctttgatg 
gacgggagct 
ggtggacagt 
gctagagacc 
tcatggaaat 
tacat tccag 
ccttcaaaaa 
acctgtggca 
tgtgctggaa 
agacttcaga 
ggaaagttaa 
ccacccccgc 
gtcaacacag 
gttatggtcc 



agtcagtcac 
gcactcgctg 
ggtggcaagc 
agaccaaagg 
tttctcaggc 
acatggagat 
gaagcagcaa 
ggattttaac 
ctgagtccat 
agaaccaaca 
aattttacaa 
gatcatttat 
aaagccctat 
cttcggtttg 
gttttccagt 
aaaatcgagg 
caagtccgtt 
aatctccagt 
atattaacaa 
tttccagtcc 
gctacactgc 
cagacacgca 
gtgccatctc 
cagatggagc 
cattctcagt 
aagggaatcc 
atgataaaga 
gtaactgtga 
attacccaga 
ccttccacta 
aatctttcca 
cacacggcga 
aaaatgtatc 
tatgtttggt 
gctgcaaagt 
gaaatgattg 
aatgtcttca 
aagggattca 
aaagcccaga 
cactggttcc 
ttgaaaacat 



ctgggtgcaa 
gcctggatgt 
gctgcaacag 
ctaccacagt 
tgtggagcgt 
tgtcaacgta 
agaaaaacaa 
atctgatatt 
gggtttatat 
aggaagcatg 
aggaaatggc 
gtctgactct 
catgtgtcat 
cagccctgct 
gcacagccca 
ctccaggtcg 
aagtagcatg 
ctccagtccc 
ctcaaggtgc 
ggcagccagt 
ttctggcacc 
ggagaaaggt 
aaatggtgtg 
ttttagcagc 
accaataaag 
aacagtaaac 
ctattattcc 
aggcagcgga 
ggccagcatc 
caggattggt 
acacctgagt 
cctgtcgtct 
aagctctact 
gtgtggggat 
tttcttcaaa 
catcattgat 
agctggaatg 
cgaggagcag 
ggaagggaca 
tcagctctcc 
tgaacctgaa 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 
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attgtatatg caggctatga cagctcaaaa ccagatacag ccgaaaatct gctctccacg 2520 
ctcaaccgct tagcaggcaa acagatgatc caagtcgtga agtgggcaaa ggtacttcca 2580 
ggatttaaaa acttgcctct tgaggaccaa attaccctaa tccagtattc ttggatgtgt 2640 
ctatcatcat ttgccttgag ctggagatcg tacaaacata cgaacagcca atttctctat 2700 
tttgcaccag acctagtctt taatgaagag aagatgcatc agtctgccat gtatgaacta 2760 
tgccagggga tgcaccaaat cagccttcag ttcgttcgac tgcagctcac ctttgaagaa 2820 
tacaccatca tgaaagtttt gctgctacta agcacaattc caaaggatgg cctcaaaagc 2 880 
caggctgcat ttgaagaaat gaggacaaat tacatcaaag aactgaggaa gatggtaact 2 94 0 
aagtgtccca acaattctgg gcagagctgg cagaggttct accaactgac caagctgctg 3 000 
gactccatgc atgacctggt gagcgacctg ctggaattct gcttctacac cttccgagag 3060 
tcccatgcgc tgaaggtaga gttccccgca atgctggtgg agatcatcag cgaccagctg 3120 
cccaaggtgg agtcggggaa cgccaagccg ctctacttcc accggaagtg actgcccgct 3180 
gcccagaaga actttgcctt aagtttccct gtgttgttcc acacccagaa ggacccaaga 3240 
aaacctgttt ttaacatgtg atggttgatt cacacttgtt caacagtttc tcaagtttaa 3300 
agtcatgtca gaggtttgga gccgggaaag ctgtttttcc gtggatttgg cgagaccaga 3 3 60 
gcagtctgaa ggattcccca cctccaatcc cccagcgctt agaaacatgt tcctgttcct 3420 
cgggatgaaa agccatatct agtcaataac tctgattttg atattttcac agatggaaga 3480 
agttttaact atgccgtgta gtttctggta tcgttcgctt gttttaaaag ggttcaagga 3540 
ctaacgaacg ttttaaagct tacccttggt ttgcacataa aacgtatagt caatatgggg 3 600 
cattaatatt cttttgttat taaaaaaaca caaaaaaata ataaaaaaat atatacagat 3660 
tcctgttgtg taataacaga actcgtggcg tggggcagca gctgcctctg agccctcgct 3720 
cgtccacggt cttctgcatc actggtatac acactcgtta gcgtccattt cttatttaat 3780 
tagaatggat aagatgatgt taaatgcctt ggtttgattt ctagtatcta ttgtgttggc 3840 
tttacaaata attttttgca gtcttttgct gtgctgtaca ttactgtatg tataaattat 3900 
gaaggacctg aaataaggta taaggatctt ttgtaaatga gacacataca aaaaaaatct 3 960 
ttaatggtta ataggatgaa tgggaaagta tttttgaaag aattctattt tgctggagac 4020 
tatttaagta ctatctttgt ctaaacaagg taattttttt ttgtaaagtg caatgtcctg 4080 
catgcataat gaaccgttta cagtgtattt aagaaaggga aagctgtgcc ttttttagct 414 0 
tcatatctaa tttaccatta ttttacagtc tctgttgtaa ataaccacac tgaaacctct 4200 
tcggttgtct tgaaaccttt ctactttttc tgtacttttt gttttgttct tggtctcccg 4260 
cttggggcat ttgtgggact ccagcacgtt ttctggcttc tgcttcatcc tgctccatcg 4320 
gggaatgaca cactgcggtg tctgcagctc ctggaaggtg tcatttgaca acacatgtgg 4380 
gagaggaggt ccttggagtg ctgcagcttt gggaaagcct gcctcgtttc ccttttcctc 4440 
tagaagcaga accagctcta cgagagtgag actgggaact tgatggctca gagagcatct 4500 
tttcctccca ttttagaaaa tcagattttc tcctgtggga aaaaaaaatt ccatgcactc 4560 
tctctctgtt aaagatcagc tattcccttc tgatcttgga aagaggttct gcactcctgg 4620 
aaccggtcac aggaacgcac agatcatggc aggatgcgct gggacggccc atcttggcaa 4680 
ggttcagtct gaatggcatg gagaccggga gatagagggg ttttagattt ttaaaaggta 4740 
ggttttaaaa ataagtttta tacataaaca gttttggaga aaaattacag atcatataag 4800 
caagacagtg gcactaaaat gtttaattca ttaatctgtt tgtttggcac tgatgcaatg 4860 
tatggctttt ctcttgcccc aaatcacaaa catatgtatc tttggggaaa ctaacaatat 4920 
gattgcacta aataaactac tttgaataga ggccaaatta atcttttaaa aatgatgata 4980 
atcatcaggt ttactcagtg aaatcatatt aattattttc caaaatctaa aagctgtagc 5040 
tggagaagcc catggccacg aggaagcagc aattaattag atcaacactt ttctccaggg 5100 
ttcaccatgc aggcaacatt accttgtctt tcaaaagaca cctgccttag tgcaagggga 5160 
aacctgtgaa agctgcactc agagggagga gtctttctta cataatttgc aatttcagga 5220 
atttaattta taggcagatc tttaaataca gtcaacttac ggtgcacagt aatatgaaag 5280 
ccacactttg aaggtaataa atacacagca tgcagactgg gagttgctag caaacaaatg 5340 
gcttacttac aaaagcagct tttagttcag acttagtttt tataaaatga gaattctgac 5400 
ttacttaacc aggtttggga tggagatggt ctgcatcagc tttttgtatt aacaaagtta 5460 
ctggctcttt gtgtgtctcc aggtaacttt gcttgattaa acagcaaagc catattctaa 5520 
attcactgtt gaatgcctgt cccagtccaa attgtctgtc tgctcttatt tttgtaccat 5580 
attgctctta aaaatcttgg tttggtacag ttcataattc accaaaaagt tcatataatt 564 0 
taaagaaaca ctaaattagt ttaaaatgaa gcaatttata tctttatgca aaaacatatg 5700 
tctgtctttg caaaggactg taagcagatt acaataaatc ctttacttt 574 9 



<210> 36 
<211> 984 
<212> PRT 

<213> Homo sapiens 
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<220> 

<223> nuclear receptor subfamily 3, group C, member 2 
(NR3C2) 

Met°Glu^Thr Lys Gly Tyr His Ser Leu Pro Glu Gly Leu Asp Met Glu 
15 10 15 

Arg Arg Trp Gly Gin Val Ser Gin Ala Val Glu Arg Ser Ser Leu Gly 
20 25 30 

Pro Thr Glu Arg Thr Asp Glu Asn Asn Tyr Met Glu He Val Asn Val 
35 40 45 

Ser Cys Val Ser Gly Ala lie Pro Asn Asn Ser Thr Gin Gly Ser Ser 
50 55 60 

Lvs Glu Lys Gin Glu Leu Leu Pro Cys Leu Gin Gin Asp Asn Asn Arg 
65 70 75 80 

Pro Gly He Leu Thr Ser Asp He Lys Thr Glu Leu Glu Ser Lys Glu 
85 90 95 

Leu Ser Ala Thr Val Ala Glu Ser Met Gly Leu Tyr Met Asp Ser Val 
100 105 110 

Arg Asp Ala Asp Tyr Ser Tyr Glu Gin Gin Asn Gin Gin Gly Ser Met 
115 120 125 

Ser Pro Ala Lys He Tyr Gin Asn Val Glu Gin Leu Val Lys Phe Tyr 
130 135 140 



Lys Gly Asn Gly His Arg Pro Ser Thr Leu Ser Cys Val Asn Thr Pro 
145 150 155 160 

Leu Arg Ser Phe Met Ser Asp Ser Gly Ser Ser Val Asn Gly Gly Val 
165 170 175 



Met Arg Ala He Val Lys Ser Pro He Met Cys His Glu Lys Ser Pro 
180 185 190 

Ser val Cys Ser Pro Leu Asn Met Thr Ser Ser Val Cys Ser Pro Ala 
195 200 205 

Gly He Asn Ser Val Ser Ser Thr Thr Ala Ser Phe Gly Ser Phe Pro 
210 215 220 

val His Ser Pro He Thr Gin Gly Thr Pro Leu Thr Cys Ser Pro Asn 
225 230 235 240 

Ala Glu Asn Arg Gly Ser Arg Ser His Ser Pro Ala His Ala Ser Asn 
245 250 255 

Val Gly Ser Pro Leu Ser Ser Pro Leu Ser Ser Met Lys Ser Ser He 
260 265 270 

Ser Ser Pro Pro Ser His Cys Ser Val Lys Ser Pro Val Ser Ser Pro 
275 280 285 

Asn Asn val Thr Leu Arg Ser Ser Val Ser Ser Pro Ala Asn He Asn 
290 295 300 
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Asn Ser Arg Cys 
305 

Thr Leu Ser Ser 



Val Asn Asn Ala 
340 

Ser Thr Leu Arg 
355 

Ala Gin Glu Val 
370 

Ser Asn Gly Val 
385 

Glu Pro Asp Gly 



He Asn Ser Asp 
420 

Lys His Ser Cys 
435 

Pro Phe Pro Phe 
450 

Asp Tyr Tyr Ser 
465 

Asp Gly Asn Cys 



Pro Asp Asp Gly 
500 

He Val Gly Val 
515 

Ala Gin Gly Thr 
530 

Gin His Leu Ser 
545 

Lys Ser His Gly 



Leu Glu Tyr He 
580 

Ser Thr Gly Ser 
595 

Glu Ala Ser Gly 
610 



Ser Val Ser Ser 
310 

Pro Ala Ala Ser 
325 

Phe Ser Tyr Thr 



Asp Val Val Pro 
360 

Pro Phe Pro Lys 
375 

Thr Gly Gin Leu 
390 

Ala Phe Ser Ser 
405 

Ser Ser Phe Ser 



Ser Gly Thr Ser 
440 

Met Asp Gly Ser 
455 

Leu Ser Gly He 
470 

Glu Gly Ser Gly 
485 

Ser Tyr Tyr Pro 



Asn Ser Gly Gly 
520 

He Ser' Leu Ser 
535 

Ser Phe Pro Pro 
550 

Asp Leu Ser Ser 

565 

Pro Glu Asn Val 



Ser Arg Pro Ser 
600 

Cys His Tyr Gly 
615 



Pro Ser Asn Thr 

315 

Thr Val Gly Ser 
330 

Ala Ser Gly Thr 
345 

Ser Pro Asp Thr 



Thr Glu Glu Val 
380 

Asn He Val Gin 

395 

Ser Cys Leu Gly 
410 

Val Pro He Lys 
425 

Phe Lys Gly Asn 



Tyr Phe Ser Phe 
460 

Leu Gly Pro Pro 
475 

Phe Pro Val Gly 
490 

Glu Ala Ser He 
505 

Gin Ser Phe His 



Arg Ser Ala Arg 
540 

Val Asn Thr Leu 
555 

Arg Arg Ser Asp 
570 

Ser Ser Ser Thr 
585 

Lys He Cys Leu 



Val Val Thr Cys 
620 



Asn Asn Arg Ser 
320 

He Cys Ser Pro 
335 

Ser Ala Gly Ser 
350 

Gin Glu Lys Gly 
365 

Glu Ser Ala He 



Tyr He Lys Pro 
400 

Gly Asn Ser Lys 
415 

Gin Glu Ser Thr 
430 

Pro Thr Val Asn 
445 

Met Asp Asp Lys 



Val Pro Gly Phe 
480 

He Lys Gin Glu 
495 

Pro Ser Ser Ala 
510 

Tyr Arg He Gly 
525 

Asp Gin Ser Phe 



Val Glu Ser Trp 
560 

Gly Tyr Pro Val 
575 

Leu Arg Ser Val 
590 

Val Cys Gly Asp 
605 

Gly Ser Cys Lys 



Val Phe Phe Lys Arg Ala Val Glu Gly Gin His Asn Tyr Leu Cys Ala 
625 630 635 640 



Gly Arg Asn Asp Cys lie lie Asp Lys lie Arg Arg Lys Asn Cys Pro 
645 650 655 

Ala Cys Arg Leu Gin Lys Cys Leu Gin Ala Gly Met Asn Leu Gly Ala 
660 665 670 

Arg Lys Ser Lys Lys Leu Gly Lys Leu Lys Gly lie His Glu Glu Gin 
675 680 685 

Pro Gin Gin Gin Gin Pro Pro Pro Pro Pro Pro Pro Pro Gin Ser Pro 
690 695 700 

Glu Glu Gly Thr Thr Tyr lie Ala Pro Ala Lys Glu Pro Ser Val Asn 
705 710 715 720 

Thr Ala Leu Val Pro Gin Leu Ser Thr lie Ser Arg Ala Leu Thr Pro 
725 730 735 

Ser Pro Val Met Val Leu Glu Asn lie Glu Pro Glu lie Val Tyr Ala 
740 745 750 

Gly Tyr Asp Ser Ser Lys Pro Asp Thr Ala Glu Asn Leu Leu Ser Thr 
755 760 765 

Leu Asn Arg Leu Ala Gly Lys Gin Met He Gin Val Val Lys Trp Ala 
770 775 780 

Lys Val Leu Pro Gly Phe Lys Asn Leu Pro Leu Glu Asp Gin He Thr 
785 790 795 800 

Leu He Gin Tyr Ser Trp Met Cys Leu Ser Ser Phe Ala Leu Ser Trp 
805 810 815 

Arg Ser Tyr Lys His Thr Asn Ser Gin Phe Leu Tyr Phe Ala Pro Asp 
820 825 830 

Leu Val Phe Asn Glu Glu Lys Met His Gin Ser Ala Met Tyr Glu Leu 
835 840 845 

Cys Gin Gly Met His Gin He Ser Leu Gin Phe Val Arg Leu Gin Leu 
850 855 860 

Thr Phe Glu Glu Tyr Thr He Met Lys Val Leu Leu Leu Leu Ser Thr 
865 870 875 880 

He Pro Lys Asp Gly Leu Lys Ser Gin Ala Ala Phe Glu Glu Met Arg 
885 890 895 

Thr Asn Tyr He Lys Glu Leu Arg Lys Met Val Thr Lys Cys Pro Asn 
900 905 910 

Asn Ser Gly Gin Ser Trp Gin Arg Phe Tyr Gin. Leu Thr Lys Leu Leu 
915 920 925 

Asp Ser Met His Asp Leu Val Ser Asp Leu Leu Glu Phe Cys Phe Tyr 
930 935 940 
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Thr Phe Arg Glu SerHis Ala Leu Lys Val Glu Phe Pro Ala Met Leu 
945 950 955 960 

Val Glu lie lie Ser Asp Gin Leu Pro Lys Val Glu Ser Gly Asn Ala 
965 970 975 

Lys Pro Leu Tyr Phe His Arg Lys 
980 



<210> 37 

<211> 782 

<212> DNA 

<213> Homo sapiens 



<400> 37 

cctaccaggt 

tgccactaca 

ctgatcttga 

ttataggagc 

aaaaattgat 

tgagacccaa 

ctatttggaa 

ttacttttca 

cctaacaact 

gcaataaata 

attcgatggc 

tgcagagctt 

tcacaacaca 

tc 



tcaagcaact 
cctggctttt 
attcctggcc 
caccacacct 
tactctgttg 
gtatctccta 
caaatctcaa 
ttcttaatag 
actgttaagt 
cagatgggac 
atctcagctt 
cattattttc 
tgcagcaata 



ctgctgcctc 
tgtattttta 
tgaagtaatc 
ggcataactg 
tttcttcctt 
cctagaaaaa 
ttaattaaca 
ctctttccat 
gattaatgaa 
tacataaatt 
tctcatagag 
tttcttctag 
atgggctcta 



agctcccaag 
gtagagatgg 
tgcctgcctc 
gtatttttta 
tttttttttt 
aaacacacta 
tatacttcaa 
aaaaatgttc 
acaggagtga 
gtggaggtcc 
ctgtttcact 
caatcagtcc 
ttggtacacc 



tagctgggat 
ttttcactat 
agcctcccaa 
tatgcttcct 
ttttggcttt 
aacagtaaat 
ggagaagact 
cacaagtgta 
caggagtgaa 
tgatgcaaaa 
gtgagggtct 
aaagcacaat 
cacagtttta 



tacaggtgca 
gttggccagg 
agtgctggga 
gggcaactta 
gaccaatttg 
gattaccaac 
taacaaaatc 
tcaaattagt 
tttaataata 
ctctctctgt 
ttatccttca 
gtcagaaaga 
tctttaacaa 



60 

120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
782 



<210> 38 

<211> 995 

<212> DNA 

<213> Homo sapiens 

<220> 

<223> tumor necrosis factor receptor superfamily, member 
17 (TNFRSF17) 



<400> 38 

aagactcaaa 

acacagacag 

agctgctctt 

tgttctttct 

gctcccaaaa 

gttcttctaa 

cagtgaaagg 

tggcagtttt 

agtttaaaaa 

ggactggtga 

gtgaagactg 

ctatggagga 

tgccagctgc 

tttcgactcg 

tctttaggat 

actctttatg 

cttggtttca 



cttagaaact 
cccccgtaag 
gctgcatttg 
gtagctccct 
tgaatatttt 
tactcctcct 
aacgaatgcg 
cgtgctaatg 
cacaggatca 
tgaaattatt 
catcaagagc 
aggcgcaacc 
tttgagtgct 
agcagtgcca 
gactgtattt 
ttagatatat 
tgattaaagt 



tgaattagat 
aacccacgaa 
ctctggaatt 
tgttttcttt 
gacagtttgt 
ctaacatgtc 
attctctgga 
tttttgctaa 
ggtctcctgg 
cttccgagag 
aaaccgaagg 
attcttgtca 
acggagatag 
ctttaaaaat 
ttcagttgcc 
ttctctaggt 
cttttttttt 



gtggtattca 
gcaggcgaag 
cttgtagaga 
ttgtgatcat 
tgcatgcttg 
agcgttattg 
cctgtttggg 
ggaagataag 
gcatggctaa 
gcctcgagta 
tcgactctga 
ccacgaaaac 
agaaatcaat 
cttttgtcag 
gatacagctt 
tactgttggg 
cctga 



aatccttacg 
ttcattgttc 
tattacttgt 
gttgcagatg 
cataccttgt 
taatgcaagt 
actgagctta 
ctctgaacca 
cattgacctg 
cacggtggaa 
ccattgcttt 
gaatgactat 
ttctgctagg 
aatagatgat 
tttgtcctct 
agcttaatgg 



tgccgcgaag 
tcaacattct 
ccttccaggc 
gctgggcagt 
caacttcgat 
gtgaccaatt 
ataatttctt 
ttaaaggacg 
gaaaagagca 
gaatgcacct 
ccactcccag 
tgcaagagcc 
taattaacca 
gtgtcagatc 
aactgtggaa 
tagaaacttc 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

995 
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<210> 39 
<211> 184 

<212> PRT 

<213> Homo sapiens 
<220> 

<223> tumor necrosis factor receptor superfamily, member 
17 (TNFRSF17) 

<400> 39 

Met Leu Gin Met Ala Gly Gin Cys Ser Gin Asn Glu Tyr Phe Asp Ser 
15 10 15 

Leu Leu His Ala Cys lie Pro Cys Gin Leu Arg Cys Ser Ser Asn Thr 
20 25 30 

Pro Pro Leu Thr Cys Gin Arg Tyr Cys Asn Ala Ser Val Thr Asn Ser 
35 40 45 

Val Lys Gly Thr Asn Ala lie Leu Trp Thr Cys Leu Gly Leu Ser Leu 
50 55 60 

lie lie Ser Leu Ala Val Phe Val Leu Met Phe Leu Leu Arg Lys lie 
65 70 75 80 



Ser Ser Glu Pro Leu Lys Asp Glu 
85 

Leu Gly Met Ala Asn lie Asp Leu 
100 

lie lie Leu Pro Arg Gly Leu Glu Tyr Thr Val Glu Glu Cys Thr Cys 
115 120 125 

Glu Asp Cys lie Lys Ser Lys Pro Lys Val Asp Ser Asp His Cys Phe 
130 135 140 

Pro Leu Pro Ala Met Glu Glu Gly Ala Thr lie Leu Val Thr Thr Lys 
145 150 155 160 



Thr Asn Asp Tyr 

lie Glu Lys Ser 
180 



Cys Lys Ser Leu 

165 

lie Ser Ala Arg 



Phe Lys Asn Thr 
90 

Glu Lys Ser Arg 
105 



Gly Ser Gly Leu 
95 

Thr Gly Asp Glu 
110 



Pro Ala Ala Leu Ser Ala Thr Glu 
170 175 



<210> 40 

<211> 2486 

<212> DNA 

<213> Homo sapiens 

<220> 

<223> hypothetical protein FLJ21511 



tcacttggca acgggacgcg 60 
gcgggaacct gggggcgcag 12 0 
agcgctgccc ctcgccgcgg 180 
gctgggatgt gtttcttggt 24 0 
tttgcaaaca ctagaactca 300 



<400> 40 

acactgcctc 

ggaacgaggg 

ggctagggca 

cgatgacctc 

ctctctacca 



ggttcggcaa 
gcgcggacgc 
gcgggcccga 
gctgtggaga 
tgacctggga 



gtgggtcagt 
aggcccggga 
cccgcacggc 
gaaatcctct 
ccgatgatct 



tggctggggc 
ggacgcggcg 
tttcctggaa 
tggagtcgct 
attactttcc 
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ctgggcttga 
tctggaaatt 
gcatagcctc 
tgtcttcctc 
acctcagaat 
atacttcact 
gtgccatagc 
agaagactgg 
ctgcttttgg 
ccagatgggc 
cagtactgct 
ctggtttgat 
catgggcagc 
aaacacttgg 
tgatatttta 
caggaggtgt 
tcgggctgaa 
acagttctaa 
ttgttggtgt 
gcaaagtggc 
acaatgaagg 
atttcataac 
ccatgtggct 
acacttgggg 
ttccgtcacc 
agctggtgga 
tgcaggctat 
gatatatcac 
tgaaggatat 
tgatcaggtt 
tggcaaaatt 
tagaccacag 
aaggacacaa 
gaaacattta 
aaaaagaaga 
catggtatct 
atttctctaa 



aggttttagt 
ggttaacaag 
cttccaggct 
actgatagtg 
ttggggattc 
aaacccaatc 
cacacttgat 
tgaggtagcc 
tagccttgtg 
agtgagtggg 
gtgcttggca 
ctggtgggtt 
tgctgtgtct 
acaccttatt 
tcttctagaa 
ctacgctaga 
tatgctattt 
agtgcttttc 
gggattgttg 
accaaccaaa 
gtggtctagt 
aattttggag 
aggggaaaag 
gattatggct 
agagggcgag 
ttttgtcgtg 
tgctgtttca 
ttcagcacct 
cgacagcact 
gggttatgca 
taggatccct 
agaagtttct 
ttatgaaaac 
aaacaagaag 
gattaatgaa 
atgcagtggg 
aaaaaaaaaa 



atagcatttc 
aagtggatgc 
ccaaatgcca 
caagctgtga 
attttaggac 
tggagttatc 
cgtattggca 
acggggatgg 
ttcctcaccc 
catccacatc 
agtggattga 
acaggaacag 
ggctgtgtct 
aactcaggga 
atatttttct 
gaaagatcag 
ggtcctaaga 
agaaagagtg 
ggattaggac 
gaggtctctg 
ctagaaagat 
agtgatgctt 
ttgggtttct 
ttgtcaagat 
atcgcaccag 
acacactttg 
aaactactga 
ggctccagag 
gatcatgaca 
agaatctccc 
gatgacccca 
gagaaaattc 
aaccataatt 
ttattggctg 
agtgggaaaa 
aaattacctc 
aaaaaa 



tttctccaat 
taaccctgct 
aacttcgact 
cttggtggtc 
agattgttct 
agatgtccaa 
cagatggtga 
cctctagacc 
actgggtttt 
cagggccaga 
tgcttccatc 
cttcagctgc 
tcgccatctt 
caaaccctgg 
gtgcctggtg 
atgtgctttt 
aaaaccttga 
aaaaatacat 
tacggcataa 
ctgccatctg 
cagctcacct 
ctaagcccta 
atacagactt 
acccaattgt 
ccatcacatt 
ggaaccacga 
aaagtagctc 
attatctaca 
gatggtgtga 
atgctgaact 
ctaattatag 
attttaatcc 
ttcatatgaa 
ggaaaatcta 
tacacatgaa 
catttgtaaa 



attcctaaca 
gaggataatc 
gatggttctt 
aggaagtcat 
tgttgttcta 
caaagtgata 
ctgcagtaaa 
caactggctg 
tggagaagtc 
tcctaaccca 
ttgtttgtgg 
ggggctcctt 
tactgcatcc 
gaaaaccatg 
cacagctttt 
ggggacaatg 
tttgcttctt 
gaaacttttt 
agcctatgag 
gcctttcagg 
gctcaatgaa 
tatggggaac 
tggtccaagc 
gaaatctgag 
gaccgttaac 
agatgacctc 
taatcaagtg 
gctcactgaa 
atacattatg 
gagtgattca 
agacaaccag 
cagatttgga 
tactcccaaa 
agaaaaaaag 
gaacctcaac 
ctatgttgct 



attactcctt 
actattggca 
gcgcttgggg 
ttgcaaaggt 
cgcatatggt 
ctgacattaa 
cctgaagaaa 
ctggcagggg 
tctcttgttt 
tttggaggtg 
tttcgtggta 
tacctgcaca 
atgtggcccc 
accattgcca 
aagtttgtcc 
atgttaatta 
caaacaaaaa 
ctgtggctgc 
agaaaactgg 
tttggatatg 
acaggtgcag 
aatgacttaa 
acaaggtatc 
catcaccttc 
atttcgggca 
gacaggaaac 
atatttctgg 
catggcaatg 
tatcgagggc 
gaaattcaga 
aaagtggtca 
tcctacaaag 
tactttttat 
tatgtaagat 
ttaaaaaaca 
taataaaaac 



360 
420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2486 



<210> 41 
<211> 699 
<212> PRT 

<213> Homo sapiens 
<220> 

<223> hypothetical protein FLJ21511 

<400> 41 

Met Thr Ser Leu Trp Arg Glu lie Leu Leu Glu Ser Leu Leu Gly Cys 
15 10 15 

Val Ser Trp Ser Leu Tyr His Asp Leu Gly Pro Met lie Tyr Tyr Phe 
20 25 30 

Pro Leu Gin Thr Leu Glu Leu Thr Gly Leu Glu Gly Phe Ser lie Ala 
35 40 45 

Phe Leu Ser Pro lie Phe Leu Thr lie Thr Pro Phe Trp Lys Leu Val 
50 55 60 
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Asn Lys Lys Trp 
65 

lie Ala Ser Phe 



Ala Leu Gly Val 
100 

Ser Gly Ser His 
115 

Gly Gin lie Val 

130 

Pro lie Trp Ser 
145 

Ala lie Ala Thr 



Pro Glu Glu Lys 
180 

Pro Asn Trp Leu 
195 

Thr His Trp Val 

210 

Ser Gly His Pro 
225 

Val Leu Leu Cys 



Phe Arg Gly Thr 
260 

Ala Gly Leu Leu 
275 

Val Phe Ala lie 
290 

Leu lie Asn Ser 
305 

lie Phe Tyr Leu 



Lys Phe Val Pro 
340 

Leu Gly Thr Met 
355 

Lys Lys Asn Leu 
370 



Met Leu Thr Leu 
70 

Gin Ala Pro Asn 
85 

Ser Ser Ser Leu 



Leu Gin Arg Tyr 
120 

Leu Val Val Leu 
135 

Tyr Gin Met Ser 
150 

Leu Asp Arg lie 
165 

Lys Thr Gly Glu 



Leu Ala Gly Ala 
200 

Phe Gly Glu Val 
215 

His Pro Gly Pro 
230 

Leu Ala Ser Gly 
245 

Gly Leu lie Trp 



Tyr Leu His Thr 
280 

Phe Thr Ala Ser 
295 

Gly Thr Asn Pro 
310 

Leu Glu lie Phe 
325 

Gly Gly Val Tyr 



Met Leu lie lie 
360 

Asp Leu Leu Leu 
375 



Leu Arg lie lie 
75 

Ala Lys Leu Arg 
90 

He Val Gin Ala 
105 

Leu Arg He Trp 



Arg He Trp Tyr 
140 

Asn Lys Val He 
155 

Gly Thr Asp Gly 
170 

Val Ala Thr Gly 
185 

Ala Phe Gly Ser 



Ser Leu Val Ser 
220 

Asp Pro Asn Pro 
235 

Leu Met Leu Pro 
250 

Trp Val Thr Gly 
265 

Trp Ala Ala Ala 



Met Trp Pro Gin 
300 

Gly Lys Thr Met 
315 

Phe Cys Ala Trp 
330 

Ala Arg Glu Arg 
345 

Gly Leu Asn Met 



Gin Thr Lys Asn 
380 



Thr He Gly Ser 
80 

Leu Met Val Leu 
95 

Val Thr Trp Trp 
110 

Gly Phe He Leu 
125 

Thr Ser Leu Asn 



Leu Thr Leu Ser 
160 

Asp Cys Ser Lys 
175 

Met Ala Ser Arg 
190 

Leu Val Phe Leu 
205 

Arg Trp Ala Val 



Phe Gly Gly Ala 
240 

Ser Cys Leu Trp 
255 

Thr Ala Ser Ala 
270 

Val Ser Gly Cys 

285 

Thr Leu Gly His 



Thr He Ala Met 
320 

Cys Thr Ala Phe 
335 

Ser Asp Val Leu 
350 

Leu Phe Gly Pro 

365 

Ser Ser Lys Val 
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Leu Phe Arg Lys Ser Glu Lys Tyr Met Lys Leu Phe Leu Trp Leu Leu 
385 390 395 400 

Val Gly Val Gly Leu Leu Gly Leu Gly Leu Arg His Lys Ala Tyr Glu 
405 410 415 

Arg Lys Leu Gly Lys Val Ala Pro Thr Lys Glu Val Ser Ala Ala He 
420 425 430 

Trp Pro Phe Arg Phe Gly Tyr Asp Asn Glu Gly Trp Ser Ser Leu Glu 
435 440 445 

Arg Ser Ala His Leu Leu Asn Glu Thr Gly Ala Asp Phe He Thr He 
450 455 460 

Leu Glu Ser Asp Ala Ser Lys Pro Tyr Met Gly Asn Asn Asp Leu Thr 
465 470 475 480 

Met Trp Leu Gly Glu Lys Leu Gly Phe Tyr Thr Asp Phe Gly Pro Ser 
485 490 495 

Thr Arg Tyr His Thr Trp Gly He Met Ala Leu Ser Arg Tyr Pro He 
500 505 510 

Val Lys Ser Glu His His Leu Leu Pro Ser Pro Glu Gly Glu He Ala 
515 520 525 

Pro Ala He Thr Leu Thr Val Asn He Ser Gly Lys Leu Val Asp Phe 
530 535 540 

Val Val Thr His Phe Gly Asn His Glu Asp Asp Leu Asp Arg Lys Leu 
545 550 555 560 

Gin Ala He Ala Val Ser Lys Leu Leu Lys Ser Ser Ser Asn Gin Val 
565 570 575 

He Phe Leu Gly Tyr He Thr Ser Ala Pro Gly Ser Arg Asp Tyr Leu 
580 585 590 

Gin Leu Thr Glu His Gly Asn Val Lys Asp He Asp Ser Thr Asp His 
595 600 605 

Asp Arg Trp Cys Glu Tyr He Met Tyr Arg Gly Leu He Arg Leu Gly 
610 615 620 

Tyr Ala Arg He Ser His Ala Glu Leu Ser Asp Ser Glu He Gin Met 
625 630 635 640 

Ala Lys Phe Arg He Pro Asp Asp Pro Thr Asn Tyr Arg Asp Asn Gin 
645 650 655 

Lys Val Val He Asp His Arg Glu Val Ser Glu Lys He His Phe Asn 
660 665 670 

Pro Arg Phe Gly Ser Tyr Lys Glu Gly His Asn Tyr Glu Asn Asn His 
675 680 685 

Asn Phe His Met Asn Thr Pro Lys Tyr Phe Leu 
690 695 
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OCl 2 5 2SHtt 1 

s 



TABLE 25: 



Table 25 depicts SEQ ID NO:, UnigenelD, UnigeneTitle, Pkey, and ExAccn for all of the 
sequences in Table 26. SEQ ED NO: links the nucleic acid and protein sequence information 
in Table 26 to Table 25. 



Pkey: 


Unique Eos probeset Identifier number 




ExAccn: 


Exemplar Accession number, Genbank accession number 




UnigenelD: Unlgene number 




Unigene Title: Unigene gene title 




SEQ ID NO: SEQ ID NO: found in Table 26 




Pkey 


ExAccn 


UnigenelD UnigeneTitle 


SEQ ID NO: 


426101 


AL049987 


Homo sapiens mRNA; cDNA DKF2p564F112 (fr 1-5 


419145 


N99638 


gb 


6&7 


426818 


AA554827 


Hs.340046 DKFZp434A0131 protein 


8&9 


421057 


T58283 


Homo sapiens cDNA 


10 


446619 


AU076643 


Hs.313 secreted phosphoprotein 1 (osteopontin, 


11&12 


431958 


X63629 


Hs.2877 cadherin 3, type 1 , P-cadherin (placenta 


13&14 


409041 


AB033025 


Hs.50081 Hypothetical protein. XP_051860 (KIAA119 


15&16 


443162 


T49951 


Hs.9029 DKFZP434G032 protein 


17&18 


436385 


BE551618 


Hs. 144097 ESTs 


19-21 


447033 


AI357412 


Hs.157601 ESTs 


22&23 


439608 


AW864696 


Hs.301732 hypotheUcal protein MGC5306 


24-28 


449032 


AA045573 


Hs.22900 nuclear factor (eiythrokl-derived 2Hik 


29&30 


442577 


AA292998 


Hs. 163900 ESTs 


31&32 


429970 


AK000072 


Hs.227059 chloride channel, calcium activated, fam 


33&34 


424566 


f\^16801 


Hs.1790 nuclear receptor subfamily 3. group C, m 


35&36 


457407 


AA505035 


Hs.345911 ESTs 


37 


430378 


Z29572 


Hs.2556 tumor necrosis factor receptor superfami 


38&39 


417332 


AW972717 


Hs.288462 hypothetical protein FU2151 1 


40&41 



Table 26 



SEQ ID NO : 1 DNA sequence 

Nucleic Acid Accession #: see Table 25 & 25A for complete list 

1 11 21 31 41 51 

I I I I I I 

CAATATAGTA CAATAACTAT TTGCATGACA TTTACATCGG ATATTATGAG TGATCTAGAG 60 

TTGATATGAA GTATATGGGA GGATGTGCAA AGGTGATGTG CAAATACTAT GTCATTTTAT 120 

AGGGGGGACT TGAGTATCCT TTGTTACCCT CAGGAGATCC TGAAACCAGT CCCCCATGGA 180 

TACTGAGGGC TGACTGTATA GTCCTATCCT CACGGAACTT TCATTCTAAT GGGGGAAGAC 24 0 

TGACTATAAA CAAAATATAT GTAATAGGTG GTGGTAAGTA CCGTGGAGAA GTAACTU^TG 3 00 

GGGCAAAGTG AGTTATACAG CTCCATTCTT AGAAACCTTG GAGTACTTTT CTTAGTTTAT 3 60 

ACTCGTGGTG GTTTCCTTTT GTCTCCTTTA TTACATGGGA CTCTGACATG TGCCCATAGC 420 

TAGGGTGACA GTAGGATCTA CCCGATAGTA GGGTGGCAGT AGGATCTACC CAAAAAGCGT 4 80 

CCTGCTGATA CAGGACCAAA GCATCCTGTT GTTCTCGAGC CTATAAAAAG AGCTAATGGT 54 0 

GTTGCTTCTC TTAACTGTGG CCTCCTACAC TGTGTTTTGG ATGATTGGTG ATGTCTTGGA 600 

TATTCTGTTT CTTTGGAACT TTGAATATAC TUVCACTTTAC TAGGGAATTA GCAATGGAAG 660 

CAGAGCAAAG ATGTACAGAG GAAACAATGC GTAACTCTGA TGGAATTGAA GTCATGAGGC 720 

AGCAGAGAGC TTAAATTACA GCTTTAAAAA TTTTTATTTT TTAGAGGGAA TTTACTTGGG 7 80 

AGTAACAGCA GTAATAGTTA ACGGAGCCAG AATGCTTGAG TCATATAATT GCAAAGCAGA 84 0 

GTTGGGAGCA ACAGATGCTA AAGAGTAGTT GCTGTAGTTC CTCTTTGGGT CGTAGGAGCA 900 

GTTGTCATAT TACTATATAG CTACTGCATG AAGAAGAGTT CTTAGTGAGG CCTGGGTGAA 960 

CAGCTCTTCT TAGTATTCTG TGTGACCCCA TTTGACCTTT TAACAAATCC CTAAGTAAAT 102 0 

AAATAGCCCC TCAGGAAAAC TAAGTTTTTC TCTGCTGTTT TTTTGCTTGA GAGAGCTATA 1080 

ACTGTAATAG ACTTATATTT CTGAACATTT TAGTGCTTGC CAATATTTGG TAATATTTAT 114 0 

GTTTCCTATA TTTGTAATGA ACATTCTTCT TCCGGTACAT TTTTTGTTAA ATTATTGTTT 12 00 

GATGGATAAA AGTTCACCTT TTATTGTATA AAATTGACTG AGATTAATTT ATACACATTG 12 60 

ACAATGGGTA AATAGAATTT TTCAGATTAT TAAAAGCTGA AGGATGACCA CGTAAGCAAA 13 2 0 

AAAAAAAAAA AAAAAACCAA CAAAAATAAA CCCAAACCCC TCAAACAATT TCGAACACGA 13 80 

AACATTCTTC TGATGCCGGC ATCCCTGCTT GCAGGTGTGA AGGGGGCAGG AATCAGCGAG 144 0 
GTGTCCTGGG CTGAGTCCCC GGGGAAGAAT ATGAT 

SEQ ID NO: 2 DNA sequence 

Nucleic Acid Accession #: X83301.1 

1 11 21 31 41 51 

I I I I I I 

GCAAAGCCAG CTGGGCTCCT GAGTCCGGTG GGTACTTGGA GAACTTACTA CGTCTAGCTG 60 

GAGGATTGTA AATGCACCAA TCAGCATGCT GTGTCTAGCT CAAGATTTTC TCCATCCCCT 12 0 

TATTTTGGGC CAGTGGCTGT CATTACATAT GAGATGAGTC TCTTGAAGAC TACAGATGAA 180 

CTCAAGCTCC ATGAGGAGAT GTTTCATTGT CGAGAGCAGT CATGATGGCC TGCACTCCAC 240 

ACAATGCAAC AGAGTGAAAG AGCAGGTTCT GCTTCTTTGG TGTAGTCCTG AAGCTTCCTA 3 00 

AGAAACTTCA CATCAGGTGA TGGATAGGAG CAACCCTGTA AAACCAGCCT TAGACTATTT 3 60 

TTCAAACAGG CTGGTGAATT ACCAGATCTC CGTCAAGTGC AGTAACCAGT TCAAGTTGGA 420 

AGTGTGTCTT TTGAATGCAG AGAACAAAGT CGTGGACAAC CAGGCTGGGA CCCAGGGCCA 4 80 

GCTGAAGGTG CTGGGTGCCA ACCTCTGGTG GCCGTACCTG ATGCACGAAC ACCCCGCCTA 54 0 

CCTGTACTCC TGGGAGGATG GTGATTGCTC ACACCAAAGC CTTGGACCCC TCCCAGCCTG 600 

TGACCTTTGG GACCAACTCC ACCTACGCAG CAGACAAGGG GGCTCTGTAT GTGGATGTGA 660 

TCCGTGTGAA CAGCTACTAC TCTTGGTATC GCAACTACGG GCACCTGGAG TTGATTCGGC 72 0 

TGCAGCTGGC CGCCCAGTTT GAGAATTGGT GTGAGACATC ACAATCCCAT TATTCAGAGC 780 

GCGTATGGAG TGGAAACGCT TGTAGGGTTT CACCAGGGCT GGTGAATTAC CAGATCTCCG 84 0 

TCAAGTGCAG TAACCAGTTC AAGTTGGAAG TATGTCTTTT GAATGCAGAA AACAAAGTCG 900 

TGGACAACCA GGCTGGGACC CAGGGCCAGC TGAAGGTGCT GGTGCCAACC TCTGGTGGCC 960 

GTACCTGATG CACGAACACC CCGCCTACCT GTACTCGTGG GAGGATGGTG ATTGCTCACA 102 0 

CCAAAGCCTT GGACCCCTCC CAGCCTGTGA CCTTTGGGAC CAACTCCACC TACGCAGCAG 1080 

ACAAGGGGGC TCTGTATGTG GATGTGATCC GTGTGAACAG CTACTACTCT TGGTATCGCA 114 0 

ACTACGGGCA CCTGGAGTTG ATTCGGCTGC AGGCCCTGCA GCTGGCCGCC CAGTTTGTGA 12 00 

ATTGGTGTAA GACATCACAA TCCCATTATT CAGAGCGCGT ATGGAGTGGA AACGCTTGTA 12 60 



GGGTTTCACC AGTCTTTCCC AGGGAACTCC GATGAAGTGT TCCAACAAAA TGAGCGAGTG 1320 

AACCAAGAAG AGGATGACAT TAGATCCAGG AGATACAACA GAGGAGATAA TCTCCAGGAT 1380 

GCCTGTGAAG AAAGATCCCT GGATCCCAGG ATGATTATAG GACAAGTTGT TCATAATCCA 144 0 

GCAGGCCAGA AGACTTCCAG GGAAACTCAT TTCAAGATGA AAATGGACCA GCCGCAGTGG 1500 

CTCACGCCTG TAATACCAGC ACTTTGGGAG GCTGAGGCGG GCGGATCACT TGAGGTCAAG 1560 

AGTTTGAAAC TAGCCTGGCC AACGTGGCAA AACTCCATCT CTATTAAAGA TACAAAAATT 162 0 

AGCCAGGCAT AGTGGTGCAT GCCTGTAGTC CCAGCTACTT GGGATGCTGA GGCAGGAAGA 1680 

ATTGCTTGAA CCTGGGAGGC AGAGTCTGCG GTGACCGAGA TCATGCCACT GCACTCCAGC 1740 
CTGGGTGACA GAGCCAGACT CCGTCTCTAC TAAAAAAAAA AAAAAAAAAA AAA 



SEQ ID NO : 3 Protein sequence : 
Protein Accession #: CAA58280.1 

1 11 21 31 41 51 

I I I I I I 

MDRSNPVKPA LDYFSNRLVN YQISVKCSNQ FKLEVCLLNA ENKWDNQAG TQGQLKVLGA 
NLWWPYLMHE HPAYLYSWED GDCSHQSLGP LPACDLWDQL HLRSRQGGSV CGCDPCEQLL 
LLVSQLRAPG VDSAAAGRPV 



SEQ ID NO: 4 DNA sequence 

Nucleic Acid Accession #: BC002622.1 

1 11 21 31 41 51 

I I I I I I 

GGCACGAGGC TCCGCCCGCG GCCGGGATGC ACTAGGCAAA GCCAGCTGGG CTCCTGAGTC 60 

CGGTGGGTAC TTGGAGAACT TACTACGTCT AGCTGGAGGA TTGTAAATGC ACCAATCAGC 12 0 

ATGCTGTGTC TAGCTCAAGA TTTTCTCCAT CCCCTTATTT TGGGCCAGTG GCTGTCATTA 180 

CATATGAGAA CTCAAGCTCC ATGAGGAGAT GTTTCATTGT CGAGAGCAGT CATGATGGCC 240 

TGCACTCCAC ACAATGCAAC AGAGTGAAAG AGCAGGTTCT GCTTCTTTGG TGTAGTCCTG 300 

AAGCTTCCTA AGAAACTTCA CATCAGGTGA TGGATAGGAG CAACCCTGTA AAACCAGCCT 360 

TAGACTATTT TTCAAACAGG CTGGTGAATT ACCAGATCTC CGTCAAGTGC AGTAACCAGT 42 0 

TCAAGTTGGA AGTGTGTCTT TTGAATGCAG AAAACAAAGT CGTGGACAAC CAGGCTGGGA 4 80 

CCCAGGGCCA GCTGAAGGTG CTGGGTGCCA ACCTCTGGTG GCCGTACCTG ATGCACGAAC 540 

ACCCCGCCTA CCTGTACTCG TGGGAGGATG GTGATTGCTC ACACCAAAGC CTTGGACCCC 600 

TCCCAGCCTG TGACCTTTGT GACCAACTCC ACCTACGCAG CAGACAAGGG GGCTCTGTAT 660 

GTGGATGTGA TCCGTGTGAA CAGCTACTAC TCTTGGTATC GCAACTACGG GCACCTGGAG 72 0 

TTGATTCAGC TGCAGCTGGC CGCCCAGTTT GAGAATTGGT GTAAGACATC ACAATCCCAT 780 

TATTCAGAGC GCGTATGGAG TGGAAACGCT TGTAGGGTTT CACCAGTCTT TCCCAGGGAA 840 

CTCCGATGAA GTGTTCCAAC AAAATGAGCG AGTGAACCAA GAAGAGGATG ACATTAGATC 900 

CAGGAGATAC AACAGAGGAG ATAATCTCCA GGATGCCTGT GAAGAAAGAT CCCTGGATCC 960 

CAGGATGATT ATAGGACAAG TTGTTCATAA TCCAGCAGGC CAGAAGACTT CCAGGGAAAC 1020 

TCATTCAAGG AGGTGi^AAAT GATGGATGAC TCCTCCAAGA TGAAAATGGA CCAGCCGCAG 1080 

TGGCTCACGC CTGTAATACC AGCACTTTGG GAGGCTGAGG CAGGCGGATC ACTTGAGGTC 114 0 

AGGAGTTTGA AACTAGCCTG GCCAACGTGG CAAAACTCCA TCTCTATTAA AAATACAAAA 12 00 

ATTAGCCAAG CATAGTGGTG CATGCCTGTA GTCCCAGCTA CTTGGGATGC TGAGGCAGGA 1260 

AGAATTGCTT GAACCTGGGA GGCAGAGTCT ACAGTGAGCC GAGATCATGC CACTGCACTC 1320 
CAGCCTGGGC AACACAGTGA GACTCCATCT CAAAAAAAAA AAAAAAAAAA AA 



SEQ ID NO: 5 Protein sequence: 
Protein Accession #: AAH02622.1 

1 11 21 31 41 51 

I I I I I I 

MDRSNPVKPA LDYFSNRLVN YQISVKCSNQ FKLEVCLLNA ENKWDNQAG TQGQLKVLGA 60 
NLWWPYLMHE HPAYLYSWED GDCSHQSLGP LPACDLCDQL HLRSRQGGSV CGCDPCEQLL 120 
LLVSQLRAPG VDSAAAGRPV 



SEQ ID NO: 6 DNA sequence 

Nucleic Acid Accession #: see Table 25 & 25A for complete list 



1 11 21 31 41 51 

I I I I I I 

ACCTGAGATC AGGAGTTCGA GATCAGCCTG ACCAATAGGG TGAAACCCCG TCTCTACTAA 60 

AAATACAAAA AATTAGCTGG ACACGATGGT GGGTGCCTGT GGTCCCGGCT ACTCGGGAGG 120 

CTGAGACAGG AGAATCAGTT GACCTGGGAG TTGGTGGTTG CAGTGAGCTG AGATCACACC 180 

ATTGCATTCC AAGCCTGGGC AACAAGAGTG AAACTCCATC GCAAAAAAAA AAAAGAAGGG 24 0 

GCATAATTTG TGGATGAGGA TTGGATATAA GGTAAAGGAT GGGACATTCT TGGACTTACA 300 

GATGGTGTGA TTGCCTGGCT AGAAGAAGAA TTCCCGGTCA AAAAGAAACC ATCAGCTTTC 360 

CT^GTGTGAA AGAGAGATAA ATCTGTGAAG ATTATAGGGA CTACAGGAAA CTTAATCTTT 42 0 

TTCTTTGAAA AAGCAATTGT AGCAAAAAAA AAGAAAATTT CTTACTGTCA TCTAAAATTG 4 80 

ACATGGACAT CTTAGTGGAC TAGAAGTTAA GGGCATAAAT TCTCCCAGTG ATTTTTAATT 54 0 

TTAGCATTGT GATTAACACC TTCTAAAATT GCCAGAACTT AATAAATAAT TGCTTTTCAT 60 0 

TATTAGTATG CCATCAAATT TAGTAGCTGT TTCAGGCTTT AATGTGTCAA GCCTAAAATC 660 

CAGATTTTTG AGGATCTTCT CCCTCTTAAA AGAGTATTCA GTTAACTGCC GTAGAAATAC 72 0 

ACATGTATAC AAGGGCACTG TATACATCAG TCTAAAAAAT AAAAATATGT ATACGTTCTG 78 0 

GTGAGTCTAG CACAGCATTG CCCAATAGAA ATACCAATGG AGGTCACAAA TGTGGCCCAT 84 0 
ATAGGTTAAT TGGTAAATTT TCTNATAGNC ACC 



SEQ ID NO : 7 DNA sequence 

Nucleic Acid Accession #: AK000942 

Coding sequence: 12 04-1503 

1 11 21 31 41 51 

GTAAAGGAAT GTCTTTTTAA TTCAGCTTTT CTTTTCTCCA TGCTAGTGTT ATCAGGTTTT 6 0 

GGTATTTATT TACTTACAGC ATATGTTATG AAGCTGGTTT GAAAATTGGT TTTAGATATA 12 0 

TCTGCAAGTT TACTACTTTG ACTGTAAAAA AAAAAAATGA AAAAGTAGTT GACATCTGTC 18 0 

CTCAGAAGAA GTTTGCAGGT TGCATATTTG TGTGTAAATA CACAGGCTAA AAGGTAATTT 24 0 

ATGTTCCTTG GGAATTGAAA TGGTCAGTGG CCCGTTACAG AAACTTATCA GTCATATATC 3 00 

AGCACCAGTT CATTCTTTTG CACCTTAGGG ACCATCTGTC CCCTGAGGTG ACCTGAGAAA 360 

CAACCAGTTG CCCACAGACT GTTATTTCTT CAAGTGAGCC AGGATTTGAT TTCACTGCCT 42 0 

TATATTCTAT TTTTAGTGTA CAGTGCTTTG ATTTTTTGGA AAAACTAAAT TTTAAACATA 480 

TTTGAAAAAT GTTATAAGAC TTGGACATTA AGTCTGTTGA TAGCCAAAGT CAGTTTACCA 54 0 

AAGTAAAACA AATAAATTCT ATGCTTCTTC ATTGTCAAAG AGCAGTCTGC CATCATGTGG 60 0 

ATATAAATGG ACTATGTAAA GTGACATGGT GCTTACTCTC TACCTAATAA TAGCCTCCCT 660 

CCTGTTCCAA CAAGATAACC AACAGGTATA TTTAATTTAC CAGTTAATAT GTTTTGGATA 72 0 

ATTGGCTGCC TTGAAATGCT ATATGTTTTA TAGTACATCA TAGCTTTAGT TTTCTTCATA 780 

AGGAAATTAC AGTTACATCC TGGCTAACAT GGTGAAACTC CATCTCTACT AAAAATACAA 84 0 

AAAATTAGCC GGGCGTGGTG GCGGGCACTT GTAGTCCCAG CTACTCGGGA GGCTGAGGCA 90 0 

GGAGAATGGC GTGAACCCAG GAGGCGGAGG TTGCAGTGAG CCGAGATCGT GCCACTGTAC 96 0 

TCTGGCCTGG GAGACAGAGC GAGACTCCAT CTCAAAAAAA AAAAAAAAAA AAAAAAAAGA 102 0 

GAGAGAGAGA CCTGGAGTAG AGATTCTGTC AAAGAACTTT TTCTTTCTTG AGAAGCATCT 108 0 

GAAATGGAAT CTGTTGTCTC TTCGAAATAT GTACTGCTGT AACAGTGAAA CAACCCTCAG 114 0 

AGTATGCCTT CGTGTGGGCT ACTCGTTGTG GTTTTGAACT TGGGGGAACT GTCTGTGTTT 12 0 0 

GGGTCAAGAA TATGCAACTG GCTGGGCACA TTGGCTCACG CCTGTAATCC CAGCAATTTG 1260 

GGAGGCTGAG GCAGGCGGAT CACCTGAGGT CAGGGCTTCA AGACCAGACT GGCCAACATG 132 0 

GTGAAACCCC GTCTCTACTG AAAATACAAA AATTAGCTGG GCATGGTGGC AGGTGCCTGT 1380 

AATCCCAGCT ACTCGGGAGG CTGACGTGAG AGAATCGCTT GAACCCGGGA GTTGGAGGTT 144 0 

GCAGTGAGCC GAGATTGCAC CATTGCACTC CAGCTTGGGC AACAAGAGTG AAACTCTTGT 1500 
CTCAG 



SEQ ID NO : 8 DNA sequence 

Nucleic Acid Accession #: see Table 25 & 25A for complete list 

1 11 21 31 41 51 

I I I I I I 

GACTAGGCTG GGCAACATAG TGAGACCTCA TCTCTAAAAT TAAAAAAATA AAAGCCACCA 60 

GAAAAAAACC TAAAAACATG CCAAGTGACA TCAGTCTTTG ATGAAAATGG CAGCAGAAGA 120 

GTGATGCCAT GGGTGGGGGT GGGAAATGCT ATTTCAGCAG AGAGGGAGCT GTCATGGAAG 180 

ACACCATGTG GCTGGGCACG GTGGCTCACA CCTGTAATCC CAGCACTTTG GGAGATAGAG 24 0 

GCAGGTGGAT CCCTTGAGCT TAGGAATTTG AGACTAGCCT GGGCAATAAG AGTGAAACTC 3 00 

CATCTCAAAA AAAAAAAAAA AAAAAGGTGC ATGAAACATA TGAAGCAAAA AGTGAAAGTC 360 

CCCATTCTTT TCCTTTTTCC AGAGGTGATT TTTGTGGCCA ATCTGGTTTC ATTCCCTCCC 420 

AGACACTTTT CTAGGCATCT ATGCGCCTCT ATTCACATAT AAACAAAATA GGAGTTTTCC 4 80 

TGTGCTTCCC TTAAATGGCA TATGTATCTT TCACTCTTTT TTTTCACCTA GTGGATCTTT 54 0 

AATACCTTAA AAGCTCAACC TGGGCTTGGT GCGGTGGCTC ATACGTGTAA TCCCAGGCCT 60 0 

TTGGGAGGCC AAGGTGGGAG GATCACTTGA GCTCAGGAGT TCCAGACCAT TCCAAAGCAA 660 

AAACAAAAGG ATTTTGAGAT CAGTGTGGGC AACTTAGCAA AACACCATCT CTTAAAAAAA 720 
AAAAAAAAAA 



SEQ ID NO: 9 DNA sequence 

Nucleic Acid Accession #: BC010433.1 
Coding sequence: 3-335 

1 11 21 31 41 51 

I I I I I I 

GGTCGCCCTC CGTCGTGGTC TGGCGTGTAT TCCGAGCCTT GGTGTCTGGC GGTTTCCGAG 60 

CGTTGGTGTC TGGCGGTTTC CGAGCGTTGG TGTCTGGCGG TTTCCGACCG TTGGTGTCTG 120 

GCGGTTTCCG ACCGTTGGTG TCTGGCACGC GCCACCCTCT CTTGCTTTGG TTGCGCCATG 180 

CCGATGTACC AGACAAGAAG ACAAGAAAAT GATTTGAGGA CAGCTTCAAT CGCGGTGTGA 240 

AGAAGAAAGC AGCAAAACGA CCACTGAAAA CAACGCCGGT GGCAAAATAT CCAAAGAAAG 3 00 

GGTCCCAAGC GGTACATCGT CATAGCCGGA AACAGTCAGA GCCACCAGCC AATGATCTTT 360 

TCAATGCTGC GAAAGCTGCC AAAAGTGACA TGCAGCACCG AGAAGTCCGC GTGAAGTGCG 42 0 

TGAAGGCTCT GAAAGGGCTG TACGGTAACC GGGACCTGAC CGCACGCCTG GAGCTCTTCA 4 80 

CTGGCCGCTT CAAGGACTGG ATGGTTTCCA TGATCATGGA CAGAGAGTAC AGTGTGGCAG 540 

TGGAGGCCGT CAGATTACTG ATACTTATCC TTAAGAACAT GGAAGGGGTG CTGATGGACG 600 

TGGACTGTGA GAGCGTCTAC CCCATTGTGT AGGCCTCTAA TTGAGGCCTG GCCTCTGCTG 660 

TGGGTGAATT TCTGTACTGG AAACTTTTCT ACCCTGAGTG CGAGATAAGA ACGATGGGTG 72 0 

GAAGAGAGCA ACGCCAGAGC CCAGGTGCCC AGAGGACTTT CTTCCAGCTT CTGCTGTCCT 7 80 

TCTTTGTGGA GAGCAAGCTC CACGACCACG CTGCTTACTT AGTAGACAAC CTGTGGGACT 840 

GTGCAGGGAC TCAGCTGAAG GACTGGGAGG GTCTGACAAG CCTGCTGCTG GAGAAGGACC 900 

AGAGCACGTG CCACATGGAG CCAGGGCCAG GGACCTTCCA CCTCCTAGGG TGAAACCAGG 960 

AGAGATTGCT TGCTTCACTT GTACAAGGCA GGAACGGTGG CATGGGGTGG GGGAAACTTG 102 0 

GAGTTGGAAG GTGGCTAATC TTTGATTCTA TGTTTTTGAT CCTCCTGGCA CTCCAGACCT 1080 

GGGTGATGTG CAGGAGAGCA CACTGATAGA AATCCTTGTG TCCAGTGCCC AGCAACTCCT 114 0 

GCCTCAGCCT CCCGAGCAGC TGGGACTACA GGCGCCCGCC ACCACGCCTG GCTAACTTTT 1200 

TTGTGTTTTT AGTAGAGACG GGTTTTCACC GTGTTGGCCA GGATGGTCTT GATCTCTTGA 1260 

CCTTGTGATC CACCTGCCTC ATCATCCCAA AGTGCTGGGA TTACAGGCGT GAGCCACTGC 132 0 

GCCCAGCATG TTAGACAATT TTTAATTCAT CCTCTCTGTG CTGTTGTTTT CTCAGCTGTG 13 80 

AAAGGAATAT TCTGGTGGGG ACAAGGTTAC AGAGTTGCTG AGAGGGTCTC ATGACATGAA 1440 

GGTACTGGCC TTGGCACAGT GCCTGGGGGG GCGGGGACTC CGCACATGCC TGTGATGTCA 1500 

CAGTTACTGT CAGTTCACAG CGAACCTTCC CTCCTTTTCC TGTTGACTTT CCCACACTCC 1560 

TGTAACCCTC CCTCCCTCCC TTCTTCCTCT CTCTCTCTCT CACTCACGCA CACGCACACA 162 0 

CACACACACA CACACACACA CACACACTCC ATTCACTGTC TCCATGACTC TGGAGTAAAC 16 80 

TAACGTCTCG AGTTGCCATT GGAAGCCCCG TTGTCCTCAT TTAGACTTTC ATGGGTTATA 1740 

GGCACTTTTG ACTTCCTGGG GTCCTTCTTC AGTT/^AAAAA AT^TW^TTAGA AAATTAGGCC 1800 

GGGCGTGGTG GCACATGCCT GTAATCCCAG CACCTTGGCC TCCCAAAGTG CTGGGATTAC 1860 

AGGAGTGAGC CACCATGCCC AGCCTCCGTT GTCCTCATTT AGACTTTCAT GGGTTATAGG 1920 

CACTTTTGAC TTCCTGGGGT CCTTCTTCAG TTAAAAAAAA AAAAAAAAAA 



SEQ ID NO: 10 DNA sequence 

Nucleic Acid Accession #: see Table 25 & 25A for complete list 



1 11 21 31 41 51 

I I I I i I 

AGTGGNTCCC CCGGNCTGCA GGAATTCGGC ACGAGATCAT GATGGCTAAT ATTTCCTGAG 60 

CACCTTTCAT TCAGGCATGA TGCCAGGTGC ACCAACTTAC TTAATCCTCA TAGCCACCAC 120 

CTGAGCAAGC TCCTGTTTTA TAAATGGACC AGTTCTTGTT GCTGTTGTAC AAGTTATTTT 180 

CTTTCTATAA CGTCCTCCTT GTCCTCCTTC CACATTCTTA AAGAAACTTT CCCTTCCTTT 24 0 

AAAGTACTCA GGGAGCCCTG CATTGCTTCT TGAAGCCTTC TCCAGCTTCA TCATCTCACA 3 00 

GTGGTCTCTC TTTTCACTAA ATGTCCAATA TGCTGCACAT AAGTACCCCA AAGTTAGCAC 3 60 

AGGAATTGTT CCATGGCTGT CATATATGTT AAAAATCATT AAAAGTTCAT TTTTTCTCTC 42 0 

ATTATGGGAA GGATACATGC TCCTACTAGT AAATTTAGTA GGTAGAAAAA AATTATCACT 4 80 

ATCTAGACTG CTTTCCATTT AGTCTTTATG CATAGCTTTC GTGTCTGCCT ATTTTTACCT 54 0 

TGTGTTTGTA ACTTACTATT ATAAAATATG CGTCTCTATG TTCATTGTCA ACGATTATTT 600 

ACAATAACAT GGAGTGGATT TACATGTATT CTCTATATTT GGATTAAAGG AGATAGAGTA 660 

TGTGAAATTA AATGGGAGAA GTATCTGATA CATAACAGGC AATACAAATA TTATCACATA 72 0 

GCGTCAATTT ATTTGTGAAT ATTGAAAGCT CCAAAAAAGA AAAAAAGTTT TTTTTTAATT 7 80 

CCCGTAATTA CTTATTGCAG TATTGTGTTC ATACAAACTG CTCAGTCATT TTGGAGAAAT 84 0 

AACAATTTTT TTCCTCATCA TGAAGTAAGG TATGCTCACT GCAAAAAAAA TCTAGAAAAT 900 

AAAGAGGAAC ATGCTAAAGA AAAGAATACT CCCATATAAT CTCTGTCTTC ATAAATAATC 960 

TTTTGTAACG CTTATACACT GCTGGTGGGA ATGTAAATTA GTTCAGCCAT TGTGTU^AAGT 102 0 

AGCGTAGCAA TTCCTTGAAA AACTTAAAAT AGATTTACCG TTCAACCCAG CAATCCCATT 1080 

ATTGGGCATA TACCCAGTGG AATGTAAATC ATCCTGCCAT AAAAACACAT GCACATGTAT 114 0 

GTTCATTGCA GCACTATTCA CAATAGCAAA GACATGGAAT CAACCTATAT GCCCATCAAT 12 00 

AGTAGACTGA ATAAAGAAAA TATGGTACAT ATTCACCACA GAATACTAAG CAGCCATAAA 1260 
AAAAAA 



SEQ ID NO: 11 DNA sequence 

Nucleic Acid Accession #: NM_000582.1 
Coding sequence: 88-990 

1 11 21 31 41 51 

I I I I I I 

GCAGAGCACA GCATCGTCGG GACCAGACTC GTCTCAGGCC AGTTGCAGCC TTCTCAGCCA 60 

AACGCCGACC AAGGAAAACT CACTACCATG AGAATTGCAG TGATTTGCTT TTGCCTCCTA 12 0 

GGCATCACCT GTGCCATACC AGTTAAACAG GCTGATTCTG GAAGTTCTGA GGAAAAGCAG 180 

CTTTACAACA AATACCCAGA TGCTGTGGCC ACATGGCTAA ACCCTGACCC ATCTCAGAAG 24 0 

CAGAATCTCC TAGCCCCACA GACCCTTCCA AGTAAGTCCA ACGAAAGCCA TGACCACATG 3 00 

GATGATATGG ATGATGAAGA TGATGATGAC CATGTGGACA GCCAGGACTC CATTGACTCG 360 

AACGACTCTG ATGATGTAGA TGACACTGAT GATTCTCACC AGTCTGATGA GTCTCACCAT 42 0 

TCTGATGAAT CTGATGAACT GGTCACTGAT TTTCCCACGG ACCTGCCAGC AACCGAAGTT 4 80 

TTCACTCCAG TTGTCCCCAC AGTAGACACA TATGATGGCC GAGGTGATAG TGTGGTTTAT 54 0 

GGACTGAGGT CAAAATCTAA GAAGTTTCGC AGACCTGACA TCCAGTACCC TGATGCTACA 600 

GACGAGGACA TCACCTCACA CATGGAAAGC GAGGAGTTGA ATGGTGCATA CAAGGCCATC 660 

CCCGTTGCCC AGGACCTGAA CGCGCCTTCT GATTGGGACA GCCGTGGGAA GGACAGTTAT 72 0 

GAAACGAGTC AGCTGGATGA CCAGAGTGCT GAAACCCACA GCCACAAGCA GTCCAGATTA 7 80 

TATAAGCGGA AAGCCAATGA TGAGAGCAAT GAGCATTCCG ATGTGATTGA TAGTCAGGAA 84 0 

CTTTCCAAAG TCAGCCGTGA ATTCCACAGC CATGAATTTC ACAGCCATGA AGATATGCTG 900 

GTTGTAGACC CCAAAAGTAA GGAAGAAGAT AAACACCTGA AATTTCGTAT TTCTCATGAA 960 

TTAGATAGTG CATCTTCTGA GGTCAATTAA AAGGAGAAAA AATACAATTT CTCACTTTGC 102 0 

ATTTAGTCAA AAGAAAAAAT GCTTTATAGC AAAATGAAAG AGAACATGAA ATGCTTCTTT 1080 

CTCAGTTTAT TGGTTGAATG TGTATCTATT TGAGTCTGGA AATAACTAAT GTGTTTGATA 114 0 

ATTAGTTTAG TTTGTGGCTT CATGGAAACT CCCTGTAAAC TAAAAGCTTC AGGGTTATGT 12 00 

CTATGTTCAT TCTATAGAAG AAATGCAAAC TATCACTGTA TTTTAATATT TGTTATTCTC 1260 

TCATGAATAG AAATTTATGT AGAAGCAAAC AAAATACTTT TACCCACTTA AAAAGAGAAT 132 0 

ATAACATTTT ATGTCACTAT AATCTTTTGT TTTTTAAGTT AGTGTATATT TTGTTGTGAT 13 8 0 

TATCTTTTTG TGGTGTGAAT AAATCTTTTA TCTTGAATGT AATAAGAATT TGGTGGTGTC 1440 

AATTGCTTAT TTGTTTTCCC ACGGTTGTCC AGCAATTAAT AAAACATAAC CTTTTTTACT 1500 
GCCTAAAAAA AAAAAAAAAA AAAA 



SEQ ID NO: 12 Protein sequence: 
Protein Accession #: NP 000573.1 



1 11 21 31 41 51 

I I I I I I 

MRIAVICFCL LGITCAIPVK QADSGSSEEK QLYNKYPDAV ATWLNPDPSQ KQNLLAPQTL 60 

PSKSNESHDH MDDMDDEDDD DHVDSQDSID SNDSDDVDDT DDSHQSDESH HSDESDELVT 120 

DFPTDLPATE VFTPWPTVD TYDGRGDSW YGLRSKSKKF RRPDIQYPDA TDEDITSHME 180 

SEELNGAYKA IPVAQDLNAP SDWDSRGKDS YETSQLDDQS AETHSHKQSR LYKRKANDES 24 0 

NEHSDVIDSQ ELSKVSREFH SHEFHSHEDM LWDPKSKEE DKHLKFRISH ELDSASSEVN 



SEQ ID NO: 13 DNA sequence 

Nucleic Acid Accession #: NM_001793 

Coding sequence: 71-2560 

1 11 21 31 41 51 

I I I I I I 

AAAGGGGCAA GAGCTGAGCG GAACACCGGC CCGCCGTCGC GGCAGCTGCT TCACCCCTCT 60 

CTCTGCAGCC ATGGGGCTCC CTCGTGGACC TCTCGCGTCT CTCCTCCTTC TCCAGGTTTG 12 0 

CTGGCTGCAG TGCGCGGCCT CCGAGCCGTG CCGGGCGGTC TTCAGGGAGG CTGAAGTGAC 180 

CTTGGAGGCG GGAGGCGCGG AGCAGGAGCC CGGCCAGGCG CTGGGGAAAG TATTCATGGG 240 

CTGCCCTGGG CAAGAGCCAG CTCTGTTTAG CACTGATAAT GATGACTTCA CTGTGCGGAA 3 00 

TGGCGAGACA GTCCAGGAAA GAAGGTCACT GAAGGAAAGG AATCCATTGA AGATCTTCCC 360 

ATCCAAACGT ATCTTACGAA GACACAAGAG AGATTGGGTG GTTGCTCCAA TATCTGTCCC 420 

TGAAAATGGC AAGGGTCCCT TCCCCCAGAG ACTGAATCAG CTCAAGTCTA ATAAAGATAG 480 

AGACACCAAG ATTTTCTACA GCATCACGGG GCCGGGGGCA GACAGCCCCC CTGAGGGTGT 540 

CTTCGCTGTA GAGAAGGAGA CAGGCTGGTT GTTGTTGAAT AAGCCACTGG ACCGGGAGGA 600 

GATTGCCAAG TATGAGCTCT TTGGCCACGC TGTGTCAGAG AATGGTGCCT CAGTGGAGGA 6 60 

CCCCATGAAC ATCTCCATCA TCGTGACCGA CCAGAATGAC CACAAGCCCA AGTTTACCCA 720 

GGACACCTTC CGAGGGAGTG TCTTAGAGGG AGTCCTACCA GGTACTTCTG TGATGCAGGT 780 

GACAGCCACG GATGAGGATG ATGCCATCTA CACCTACAAT GGGGTGGTTG CTTACTCCAT 840 

CCATAGCCAA GAAGGAAAGG ACCCACACGA CCTCATGTTC ACCATTCACC GGAGCACAGG 900 

CACCATCAGC GTCATCTCCA GTGGCCTGGA CCGGGAAAAA GTCCCTGAGT ACACACTGAC 960 

CATCCAGGCC ACAGACATGG ATGGGGACGG CTCCACCACC ACGGCAGTGG CAGTAGTGGA 1020 

GATCCTTGAT GCCAATGACA ATGCTCCCAT GTTTGACCCC CAGAAGTACG AGGCCCATGT 1080 

GCCTGAGAAT GCAGTGGGCC ATGAGGTGCA GAGGCTGACG GTCACTGATC TGGACGCCCC 114 0 

CAACTCACCA GCGTGGCGTG CCACCTACCT TATCATGGGC GGTGACGACG GGGACCATTT 12 00 

TACCATCACC ACCCACCCTG AGAGCAACCA GGGCATCCTG ACAACCAGGA AGGGTTTGGA 12 60 

TTTTGAGGCC AAAAACCAGC ACACCCTGTA CGTTGAAGTG ACCAACGAGG CCCCTTTTGT 1320 

GCTGAAGCTC CCAACCTCCA CAGCCACCAT AGTGGTCCAC GTGGAGGATG TCTU^TGAGGC 13 80 

ACCTGTGTTT GTCCCACCCT CCAAAGTCGT TGAGGTCCAG GAGGGCATCC CCACTGGGGA 1440 

GCCTGTGTGT GTCTACACTG CAGAAGACCC TGACAAGGAG AATCAAAAGA TCAGCTACCG 15 00 

CATCCTGAGA GACCCAGCAG GGTGGCTAGC CATGGACCCA GACAGTGGGC AGGTCACAGC 15 60 

TGTGGGCACC CTCGACCGTG AGGATGAGCA GTTTGTGAGG AACAACATCT ATGAAGTCAT 1620 

GGTCTTGGCC ATGGACAATG GAAGCCCTCC CACCACTGGC ACGGGAACCC TTCTGCTAAC 1680 

ACTGATTGAT GTCAATGACC ATGGCCCAGT CCCTGAGCCC CGTCAGATCA CCATCTGCAA 1740 

CCAAAGCCCT GTGCGCCAGG TGCTGAACAT CACGGACAAG GACCTGTCTC CCCACACCTC 18 00 

CCCTTTCCAG GCCCAGCTCA CAGATGACTC AGACATCTAC TGGACGGCAG AGGTCAACGA 1860 

GGAAGGTGAC ACAGTGGTCT TGTCCCTGAA GAAGTTCCTG AAGCAGGATA CATATGACGT 1920 

GCACCTTTCT CTGTCTGACC ATGGCAACAA AGAGCAGCTG ACGGTGATCA GGGCCACTGT 1980 

GTGCGACTGC CATGGCCATG TCGAAACCTG CCCTGGACCC TGGAAGGGAG GTTTCATCCT 2 040 

CCCTGTGCTG GGGGCTGTCC TGGCTCTGCT GTTCCTCCTG CTGGTGCTGC TTTTGTTGGT 2100 

GAGAAAGAAG CGGAAGATCA AGGAGCCCCT CCTACTCCCA GAAGATGACA CCCGTGACAA 2160 

CGTCTTCTAC TATGGCGAAG AGGGGGGTGG CGAAGAGGAC CAGGACTATG ACATCACCCA 2220 

GCTCCACCGA GGTCTGGAGG CCAGGCCGGA GGTGGTTCTC CGCAATGACG TGGCACCAAC 2280 

CATCATCCCG ACACCCATGT ACCGTCCTCG GCCAGCCAAC CCAGATGAAA TCGGCAACTT 2 340 

TATAATTGAG AACCTGAAGG CGGCTAACAC AGACCCCACA GCCCCGCCCT ACGACACCCT 24 00 

CTTGGTGTTC GACTATGAGG GCAGCGGCTC CGACGCCGCG TCCCTGAGCT CCCTCACCTC 24 60 

CTCCGCCTCC GACCAAGACC AAGATTACGA TTATCTGAAC GAGTGGGGCA GCCGCTTCAA 2520 

GAAGCTGGCA GACATGTACG GTGGCGGGGA GGACGACTAG GCGGCCTGCC TGCAGGGCTG 2580 



GGGACCAAAC GTCAGGCCAC AGAGCATCTC CAAGGGGTCT 
GACTTCGGAG CTTGTCAGGA AGTGGCCGTA GCAACTTGGC 
ACGTTAGAGT GGTTGCTTCC TTAGCCTTTC AGGATGGAGG 
AGCACTGAAA ACCTCTCCAC CTGGGCCAGG GTTGCCTCAG 
TCTTACCTGC CGTAAAATGC TCAACCCTGT GTCCTGGGCC 
TACAGTGGAC TTTCTCTCTG GAATGGAACC TTCTTAGGCC 
TTTTTTTAAT GCTATCTTCA AAACGTTAGA GAAAGTTCTT 
GCTGGGCCCA CTGGCCGTCC TGCATTTCTG GTTTCCAGAC 
TGGATCTCTG CGTTTTTATA CTGAGTGTGC CTAGGTTGCC 
GTTGCGTTGC TATAGATGAA GGGTGAGGAC AATCGTGTAT 
TAAAGAAACT TTTCCCAGAA AAAAA 



SEQ ID NO: 14 Protein sequence: 
Protein Accession #: NP_001784.2 

1 11 21 31 

I I I I 

MGLPRGPLAS LLLLQVCWLQ CAASEPCRAV FREAEVTLEA 
QEPALFSTDN DDFTVRNGET VQERRSLKER NPLKIFPSKR 
KGPFPQRLNQ LKSNKDRDTK IFYSITGPGA DSPPEGVFAV 
YELFGHAVSE NGASVEDPMN ISIIVTDQND HKPKFTQDTF 
DEDDAIYTYN GWAYSIHSQ EPKDPHDLMF TIHRSTGTIS 
TDMDGDGSTT TAVAWEILD ANDNAPMFDP QKYEAHVPEN 
AWRATYLIMG GDDGDHFTIT THPESNQGIL TTRKGLDFEA 
PTSTATIWH VEDVNEAPVF VPPSKWEVQ EGIPTGEPVC 
DPAGWLAMDP DSGQVTAVGT LDREDEQFVR NNIYEVMVLA 
VNDHGPVPEP RQITICNQSP VRQVLNITDK DLSPHTSPFQ 
TWLSLKKFL KQDTYDVHLS LSDHGNKEQL TVIRATVCDC 
GAVLALLFLL LVLLLLVRKK RKIKEPLLLP EDDTRDNVFY 
GLEARPEWL RNDVAPTIIP TPMYRPRPAN PDEXGNFIIE 
DYEGSGSDAA SLSSLTSSAS DQDQDYDYLN EWGSRFKKLA 

SEQ ID NO: 15 DNA sequence 

Nucleic Acid Accession #: XM_051860.2 
Coding sequence: 261-4346 

1 11 21 31 

I I I I 

GAGCTAGCGC TCAAGCAGAG CCCAGCGCGG TGCTATCGGA 

CGGCGCGGGG AGCCAGCGGG GCTGAGCGCG GCCAGGGTCT 

AGCTACCACT CCGCTTGCCC ACGCCCCGGG AGCTCGCGGC 

ACGTCCGGGG CCGCTGCGCT CCTGGCCCGC GAGGCGTGAC 

AGAGGGAGCA CACTGCCAGG ATGGGAGCTG CTGGGAGGCA 

TGCTGACCAT CAGCTGGCTC ACTCTGACCT GCTTCCCTGG 

CTGGGTGCCC TGACCAGAGC CCTGAGTTGC AACCCTGGAA 

ACCATGTGCA TATCGGCCAG GGCAAGACAC TGCTGCTCAC 

CCATCCACAT CTCAGAGGGA GGCAAGCTGG TCATTAAAGA 

TGCGAACCCG GCACATCCTG ATTGACAACG GAGGAGAGCT 

GCCCTTTCCA GGGCAATTTC ACCATCATTT TGTATGGAAG 

CGGATCCTTA CTATGGTCTG AAGTACATTG GGGTTGGTAA 

ATGGACAGAA AAAGCTCTCC TGGACATTTC TGAACAAGAC 

CAGAAGGAGG CTATTTTTTT GAAAGGAGCT GGGGCCACCG 

TCGACCCCAA ATCAGGCACA GTCATCCATT CTGACCGGTT 

AAGAGAGTGA ACGTCTGGTC CAGTATTTGA ACGCGGTGCC 

TTGCAGTGAA TGATGAAGGT TCTCGAAATC TGGATGACAT 

AATTGGGAAG CAAACACTTC CTGCACCTTG GATTTAGACA 

TGAAAGGAAA TCCATCATCT TCAGTGGAAG ACCATATTGA 

CTGCTGCTGC CCGGGTATTC AAATTGTTCC AGACAGAGCA 

CTTTGTCCAG TGAGTGGGTT CAAGACGTGG AGTGGACGGA 



CAGTTCCCCC 


TTCAGCTGAG 


2640 


GGAGACAGGC 


TATGAGTCTG 


2700 


AATGTGGGCA 


GTTTGACTTC 


2760 


AGGCCAAGTT 


TCCAGAAGCC 


2820 


TGGGCCTGCT 


GTGACTGACC 


2880 


TCCTGGTGCA 


ACTTAATTTT 


2940 


CAAAAGTGCA 


GCCCAGAGCT 


3000 


CCCAATGCCT 


CCCATTCGGA 


3060 


CCTTATTTTT 


TATTTTCCCT 


3120 


ATGTACTAGA 


ACTTTTTTAT 


3180 


41 

1 


51 
1 




1 

GGAEQEPGQA 


1 

LGKVFMGCPG 


60 


ILRRHKRDWV 


VAPISVPENG 


120 


EKETGWLLLN 


KPLDREEIAK 


180 


RGSVLEGVLP 


GTSVMQVTAT 


240 


VISSGLDREK 


VPEYTLTIQA 


300 


AVGHEVQRLT 


VTDLDAPNSP 


360 


KNQHTLYVEV 


TNEAPFVLKL 


420 


VYTAEDPDKE 


NQKISYRILR 


480 


MDNGSPPTTG 


TGTLLLTLID 


540 


AQLTDDSDIY 


WTAEVNEEGD 


600 


HGHVETCPGP 


WKGGFILPVL 


660 


YGEEGGGEED 


QDYDITQLHR 


720 


NLKAANTDPT 


APPYDTLLVF 


780 


DMYGGGEDD 






4 1 
1 


51 
1 




1 

CAGAGCCTGG 


1 

CGAGCGCAAG 


60 


GAACCCAGAT 


TTCCCAGACT 


120 


GCCTGGCGGT 


CAGCGACCAG 


180 


ACTGTCTCGG 


CT AC AGAC C C 


240 


GGACTTCCTC 


TTCAAGGCCA 


300 


GGCCACATCC 


ACAGTGGCTG 


360 


CCCTGGCCAT 


GACCAAGACC 


420 


CTCTTCTGCC 


ACGGTCTATT 


480 


C C ACG ACG AG 


CCGATTGTTT 

X^^^XJ^»X X \J X X X 


540 


GCATGCTGGG 


AGTGCCCTCT 


600 


RRPTGATC3AA 


GGTATTCAGC 


660 




CTTGAGTTGC 
X X yjnsj x x w v.. 


720 


CCTTCACCCA 




780 


X \jyjt\\j X X n. X x 


OTTPATGTCA 

\J X X X V7 X \^n. 


84 0 


Tnar'a r*r"T* AT 

i. V7nV^n^^ 1 A. 1 


AGATrT'AAriA 




CGATGGCAGG 


ATCCTTTCTG 


960 


GGCCAGGAAG 


GCGATGACCA 


1020 


CCCTTGGAGT 


TTTCTAACTG 


1080 


ATATCATGGA 


CATCGAGGCT 


1140 


TGGCGAATAT 


TTCAATGTTT 


1200 


GTGGTTCGAT 


CATGATAAAG 


1260 



TATCTCAGAC TAAAGGTGGG GAGAAAATTT CAGACCTCTG GAAAGCTCAC CCAGGAAAAA 132 0 

TATGCAATCG TCCCATTGAT ATACAGGCCA CTACAATGGA TGGAGTTAAC CTCAGCACCG 1380 

AGGTTGTCTA CAAAAAAGGC CAGGATTATA GGTTTGCTTG CTACGACCGG GGCAGAGCCT 144 0 

GCCGGAGCTA CCGTGTACGG TTCCTCTGTG GGAAGCCTGT GAGGCCCAAA CTCACAGTCA 1500 

CCATTGACAC C/^TGTGAAC AGCACCATTC TGAACTTGGA GGATAATGTA CAGTCATGGA 1560 

AACCTGGAGA TACCCTGGTC ATTGCCAGTA CTGATTACTC CATGTACCAG GCAGAAGAGT 162 0 

TCCAGGTGCT TCCCTGCAGA TCCTGCGCCC CCAACCAGGT CAAAGTGGCA GGGAAACCAA 1680 

TGTACCTGCA CATCGGGGAG GAGATAGACG GCGTGGACAT GCGGGCGGAG GTTGGGCTTC 174 0 

TGAGCCGGAA CATCATAGTG ATGGGGGAGA TGGAGGACAA ATGCTACCCC TACAGAAACC 1800 

ACATCTGCAA TTTCTTTGAC TTCGATACCT TTGGGGGCCA CATCAAGTTT GCTCTGGGAT 1860 

TTAAGGCAGC ACACTTGGAG GGCACGGAGC TGAAGCATAT GGGACAGCAG CTGGTGGGTC 192 0 

AGTACCCGAT TCACTTCCAC CTGGCCGGTG ATGTAGACGA AAGGGGAGGT TATGACCCAC 1980 

CCACATACAT CAGGGACCTC TCCATCCATC ATACATTCTC TCGCTGCGTC ACAGTCCATG 2 04 0 

GCTCCAATGG CTTGTTGATC AAGGACGTTG TGGGCTATAA CTCTTTGGGC CACTGCTTCT 210 0 

TCACGGAAGA TGGGCCGGAG GAACGCAACA CTTTTGACCA CTGTCTTGGC CTCCTTGTCA 216 0 

AGTCTGGAAC CCTCCTCCCC TCGGACCGTG ACAGCAAGAT GTGCAAGATG ATCACAGAGG 222 0 

ACTCCTACCC GGGGTACATC CCCAAGCCCA GGCAAGACTG CAATGCTGTG TCCACCTTCT 22 8 0 

GGATGGCCAA TCCCAACAAC AACCTCATCA ACTGTGCCGC TGCAGGATCT GAGGAAACTG 2 34 0 

GATTTTGGTT TATTTTTCAC CACGTACCAA CGGGCCCCTC CGTGGGAATG TACTCCCCAG 240 0 

GTTATTCAGA GCACATTCCA CTGGGAAAAT TCTATAACAA CCGAGCACAT TCCAACTACC 2 4 60 

GGGCTGGCAT GATCATAGAC AACGGAGTCA AAACCACCGA GGCCTCTGCC AAGGACAAGC 2 520 

GGCCGTTCCT CTCAATCATC TCTGCCAGAT ACAGCCCTCA CCAGGACGCC GACCCGCTGA 2 580 

AGCCCCGGGA GCCGGCCATC ATCAGACACT TCATTGCCTA CAAGAACCAG GACCACGGGG 2 64 0 

CCTGGCTGCG CGGCGGGGAT GTGTGGCTGG ACAGCTGCCG GTTTGCTGAC AATGGCATTG 27 0 0 

GCCTGACCCT GGCCAGTGGT GGAACCTTCC CGTATGACGA CGGCTCCAAG CAAGAGATAA 2 76 0 

AGAACAGCTT GTTTGTTGGC GAGAGTGGCA ACGTGGGGAC GGAAATGATG GACAATAGGA 2 82 0 

TCTGGGGCCC TGGCGGCTTG GACCATAGCG GAAGGACCCT CCCTATAGGC CAGAATTTTC 2 880 

CAATTAGAGG AATTCAGTTA TATGATGGCC CCATCAACAT CCAAAACTGC ACTTTCCGAA 2 94 0 

AGTTTGTGGC CCTGGAGGGC CGGCACACCA GCGCCCTGGC CTTCCGCCTG AATAATGCCT 3 00 0 

GGCAGAGCTG CCCCCATAAC AACGTGACCG GCATTGCCTT TGAGGACGTT CCGATTACTT 3 06 0 

CCAGAGTGTT CTTCGGAGAG CCTGGGCCCT GGTTCAACCA GCTGGACATG GATGGGGATA 312 0 

AGACATCTGT GTTCCATGAC GTCGACGGCT CCGTGTCCGA GTACCCTGGC TCCTACCTCA 3180 

CGAAGAATGA CAACTGGCTG GTCCGGCACC CAGACTGCAT CAATGTTCCC GACTGGAGAG 3 24 0 

GGGCCATTTG CAGTGGGTGC TATGCACAGA TGTACATTCA AGCCTACAAG ACCAGTAACC 3 3 00 

TGCGAATGAA GATCATCAAG AATGACTTCC CCAGCCACCC TCTTTACCTG GAGGGGGCGC 3 360 

TCACCAGGAG CACCCATTAC CAGCAATACC AACCGGTTGT CACCCTGCAG AAGGGCTACA 342 0 

CCATCCACTG GGACCAGACG GCCCCCGCCG AACTCGCCAT CTGGCTCATC AACTTCAACA 34 8 0 

AGGGCGACTG GATCCGAGTG GGGCTCTGCT ACCCGCGAGG CACCACATTC TCCATCCTCT 3 54 0 

CGGATGTTCA CAATCGCCTG CTGAAGCAAA CGTCCAAGAC GGGCGTCTTC GTGAGGACCT 3 600 

TGCAGATGGA CAAAGTGGAG CAGAGCTACC CTGGCAGGAG CCACTACTAC TGGGACGAGG 3660 

ACTCAGGGCT GTTGTTCCTG AAGCTGAAAG CTCAGAACGA GAGAGAGAAG TTTGCTTTCT 3 72 0 

GCTCCATGAA AGGCTGTGAG AGGATAAAGA TTAAAGCTCT GATTCCAAAG AACGCAGGCG 3 78 0 

TCAGTGACTG CACAGCCACA GCTTACCCCA AGTTCACCGA GAGGGCTGTC GTAGACGTGC 3 84 0 

CGATGCCCAA GAAGCTCTTT GGTTCTCAGC TGAAAACAAA GGACCATTTC TTGGAGGTGA 3 90 0 

AGATGGAGAG TTCCAAGCAG CACTTCTTCC ACCTCTGGAA CGACTTCGCT TACATTGAAG 3 960 

TGGATGGGAA GAAGTACCCC AGTTCGGAGG ATGGCATCCA GGTGGTGGTG ATTGACGGGA 4 02 0 

ACCAAGGGCG CGTGGTGAGC CACACGAGCT TCAGGAACTC CATTCTGCAA GGCATACCAT 4 080 

GGCAGCTTTT CAACTATGTG GCGACCATCC CTGACAATTC CATAGTGCTT ATGGCATCAA 414 0 

AGGGAAGATA CGTCTCCAGA GGCCCATGGA CCAGAGTGCT GGAAAAGCTT GGGGCAGACA 42 00 

GGGGTCTCAA GTTGAAAGAG CAAATGGCAT TCGTTGGCTT CAAAGGCAGC TTCCGGCCCA 42 60 

TCTGGGTGAC ACTGGACACT GAGGATCACA AAGCCAAAAT CTTCCAAGTT GTGCCCATCC 4320 

CTGTGGTGAA GAAGAAGAAG TTGTGAGGAC AGCTGCCGCC CGGTGCCACC TCGTGGTAGA 43 80 

CTATGACGGT GACTCTTGGC AGCAGACCAG TGGGGGATGG CTGGGTCCCC CAGCCCCTGC 4 44 0 

CAGCAGCTGC CTGGGAAGGC CGTGTTTCAG CCCTGATGGG CCAAGGGAAG GCTATCAGAG 4500 

ACCCTGGTGC TGCCACCTGC CCCTACTCAA GTGTCTACCT GGAGCCCCTG GGGCGGTGCT 4560 

GGCCAATGCT GGAAACATTC ACTTTCCTGC AGCCTCTTGG GTGCTTCTCT CCTATCTGTG 4 62 0 

CCTCTTCAGT GGGGGTTTGG GGACCATATC AGGAGACCTG GGTTGTGCTG ACAGCAAAGA 4 680 

TCCACTTTGG CAGGAGCCCT GACCCAGCTA GGAGGTAGTC TGGAGGGCTG GTCATTCACA 4 74 0 

GATCCCCATG GTCTTCAGCA GACAAGTGAG GGTGGTAAAT GTAGGAGAAA GAGCCTTGGC 4 80 0 

CTTAAGGAAA TCTTTACTCC TGTAAGCAAG AGCCAACCTC ACAGGATTAG GAGCTGGGGT 4 860 

AGAACTGGCT ATCCTTGGGG AAGAGGCAAG CCCTGCCTCT GGCCGTGTCC ACCTTTCAGG 4 920 



AGACTTTGAG TGGCAGGTTT GGACTTGGAC TAGATGACTC TCAAAGGCCC TTTTAGTTCT 4 980 

GAGATTCCAG AAATCTGCTG CATTTCACAT GGTACCTGGA ACCCAACAGT TCATGGATAT 504 0 

CCACTGATAT CCATGATGCT GGGTGCCCCA GCGCACACGG GATGGAGAGG TGAGAACTAA 5100 

TGCCTAGCTT GAGGGGTCTG CAGTCCAGTA GGGCAGGCAG TCAGGTCCAT GTGCACTGCA 5160 

ATGCCAGGTG GAGAAATCAC AGAGAGGTAA AATGGAGGCC AGTGCCATTT CAGAGGGGAG 522 0 

GCTCAGGAAG GCTTCTTGCT TACAGGAATG AAGGCTGGGG GCATTTTGCT GGGGGGAGAT 52 80 

GAGGCAGCCT CTGGAATGGC TCAGGGATTC AGCCCTCCCT GCCGCTGCCT GCTGAAGCTG 5340 

GTGACTACGG GGTCGCCCTT TGCTCACGTC TCTCTGGCCC ACTCATGATG GAGAAGTGTG 54 00 

GTCAGAGGGG AGCAATGGGC TTTGCTGCTT ATGAGCACAG AGGAATTCAG TCCCCAGGCA 54 6 0 

GCCCTGCCTC TGACTCCAAG AGGGTGAAGT CCACAGAAGT GAGCTCCTGC CTTAGGGCCT 5 52 0 

CATTTGCTCT TCATCCAGGG AACTGAGCAC AGGGGGCCTC CAGGAGACCC TAGATGTGCT 558 0 

CGTACTCCCT CGGCCTGGGA TTTCAGAGCT GGAAATATAG AAAATATCTA GCCCAAAGCC 564 0 

TTCATTTTAA CAGATGGGGA AAGTGAGCCC CCAAGATGGG AAAGAACCAC ACAGCTAAGG 57 00 

GAGGGCCTGG GGAGCCCCAC CCTAGCCCTT GCTGCCACAC CACATTGCCT CAACAACCGG 57 60 

CCCCAGAGTG CCCAGGCACT CCTGAGGTAG CTTCTGGAAA TGGGGACAAG TCCCCTCGAA 5 82 0 

GGAAAGGAAA TGACTAGAGT AGAATGACAG CTAGCAGATC TCTTCCCTCC TGCTCCCAGC 5880 

GCACACAAAC CCGCCCTCCC CTTGGTGTTG GCGGTCCCTG TGGCCTTCAC TTTGTTCACT 594 0 

ACCTGTCAGC CCAGCCTGGG TGCACAGTAG CTGCAACTCC CCATTGGTGC TACCTGGCTC 6000 

TCCTGTCTCT GCAGCTCTAC AGGTGAGGCC CAGCAGAGGG AGTAGGGCTC GCCATGTTTC 6060 

TGGTGAGCCA ATTTGGCTGA TCTTGGGTGT CTGAACAGCT ATTGGGTCCA CCCCAGTCCC 612 0 

TTTCAGCTGC TGCTTAATGC CCTGCTCTCT CCCTGGCCCA CCTTATAGAG AGCCCAAAGA 6180 

GCTCCTGTAA GAGGGAGAAC TCTATCTGTG GTTTATAATC TTGCACGAGG CACCAGAGTC 6240 

TCCCTGGGTC TTGTGATGAA CTACATTTAT CCCCTTTCCT GCCCCAACCA CAAACTCTTT 63 00 

CCTTCAAAGA GGGCCTGCCT GGCTCCCTCC ACCCAACTGC ACCCATGAGA CTCGGTCCAA 63 60 

GAGTCCATTC CCCAGGTGGG AGCCAACTGT CAGGGAGGTC TTTCCCACCA AACATCTTTC 64 2 0 

AGCTGCTGGG AGGTGACCAT AGGGCTCTGC TTTTAAAGAT ATGGCTGCTT CAAAGGCCAG 64 80 

AGTCACAGGA AGGACTTCTT CCAGGGAGAT TAGTGGTGAT GGAGAGGAGA GTTAAAATGA 654 0 

CCTCATGTCC TTCTTGTCCA CGGTTTTGTT GAGTTTTCAC TCTTCTAATG CAAGGGTCTC 6600 

ACACTGTGAA CCACTTAGGA TGTGATCACT TTCAGGTGGC CAGGAATGTT GAATGTCTTT 6660 

GGCTCAGTTC ATTTAAAAAA GATATCTATT TGAAAGTTCT CAGAGTTGTA CATATGTTTC 6720 

ACAGTACAGG ATCTGTACAT AAAAGTTTCT TTCCTAAACC ATTCACCAAG AGCCAATATC 6780 

TAGGCATTTT CTTGGTAGCA CAAATTTTCT TATTGCTTAG AAAATTGTCC TCCTTGTTAT 684 0 

TTCTGTTTGT AAGACTTAAG TGAGTTAGGT CTTTAAGGAA AGCAACGCTC CTCTGAAATG 6900 

CTTGTCTTTT TTCTGTTGCC GAAATAGCTG GTCCTTTTTC GGGAGTTAGA TGTATAGAGT 6960 

GTTTGTATGT AAACATTTCT TGTAGGCATC ACCATGAACA AAGATATATT TTCTATTTAT 702 0 
TTATTATATG TGCACTTCAA GAAGTCACTG TCAGAGAAAT AAAGAATTGT CTTAAATGTC 



SEQ ID NO: 16 Protein sequence: 
Protein Accession #: XP 051860.2 



MGAAGRQDFL 
GKTLLLTSSA 
TIILYGRADE 
ERSWGHRGVI 
SRNLDDMARK 
KLFQTEHGEY 
IQATTMDGVN 
STILNLEDNV 
EIDGVDMRAE 
GTELKHMGQQ 
KDWGYNSLG 
PKPRQDCNAV 
LGKFYNNRAH 
IRHFIAYKNQ 
ESGNVGTEMM 
RHTSALAFRL 
VDGSVSEYPG 
NDFPSHPLYL 



11 

I 

FKAMLTISWL 
TVYSIHISEG 
GIQPDPYYGL 
VHVIDPKSGT 
AMTKLGSKHF 
FNVSLSSEWV 
LSTEWYKKG 
QSWKPGDTLV 
VGLLSRNIIV 
LVGQYPIHFH 
HCFFTEDGPE 
STFWMANPNN 
SNYRAGMIID 
DHGAWLRGGD 
DNRIWGPGGL 
NNAWQSCPHN 
SYLTKNDNWL 
EGALTRSTHY 



21 

I 

TLTCFPGATS 
GKLVIKDHDE 
KYIGVGKGGA 
VIHSDRFDTY 
LHLGFRHPWS 
QDVEWTEWFD 
QDYRFACYDR 
lASTDYSMYQ 
MGEMEDKCYP 
LAGDVDERGG 
ERNTFDHCLG 
NLINCAAAGS 
NGVKTTEASA 
VWLDSCRFAD 
DHSGRTLPIG 
NVTGIAFEDV 
VRHPDCINVP 
QQYQPWTLQ 



31 

I 

TVAAGCPDQS 
PIVLRTRHIL 
LELHGQKKLS 
RSKKESERLV 
FLTVKGNPSS 
HDKVSQTKGG 
GRACRSYRVR 
AEEFQVLPCR 
YRNHICNFFD 
YDPPTYIRDL 
LLVKSGTLLP 
EETGFWFIFH 
KDKRPFLSII 
NGIGLTLASG 
QNFPIRGIQL 
PITSRVFFGE 
DWRGAICSGC 
KGYTIHWDQT 



41 

I 

PELQPWNPGH 
IDNGGELHAG 
WTFLNKTLHP 
QYLNAVPDGR 
SVEDHIEYHG 
EKISDLWKAH 
FLCGKPVRPK 
SCAPNQVKVA 
FDTFGGHIKF 
SIHHTFSRCV 
SDRDSKMCKM 
HVPTGPSVGM 
SARYSPHQDA 
GTFPYDDGSK 
YDGPINIQNC 
PGPWFNQLDM 
YAQMYIQAYK 
APAELAIWLI 



51 

! 

DQDHHVHIGQ 
SALCPFQGNF 
GGMAEGGYFF 
ILSVAVNDEG 
HRGSAAARVF 
PGKICNRPID 
LTVTIDTNVN 
GKPMYLHIGE 
ALGFKAAHLE 
TVHGSNGLLI 
ITEDSYPGYI 
YSPGYSEHIP 
DPLKPREPAI 
QEIKNSLFVG 
TFRKFVALEG 
DGDKTSVFHD 
TSNLRMKIIK 
NFNKGDWIRV 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 



GLCYPRGTTF SILSDVHNRL LKQTSKTGVF VRTLQMDKVE QSYPGRSHYY WDEDSGLLFL 114 0 

KLKAQNEREK FAFCSMKGCE RIKIKALIPK NAGVSDCTAT AYPKFTERAV VDVPMPKKDF 12 00 

GSQLKTKDHF LEVKMESSKQ HFFHLWNDFA YIEVDGKKYP SSEDGIQVW IDGNQGRWS 12 60 

HTSFRNSILQ GIPWQLFNYV ATIPDNSIVL MASKGRYVSR GPWTRVLEKL GADRGLKLKE 1320 

QMAFVGFKGS FRPIWVTLDT EDHKAKIFQV VPIPWKKKK L 



SEQ ID NO: 17 DNA sequence 

Nucleic Acid Accession #: NM__015515.1 
Coding sequence: 61-132 9 

1 11 21 31 41 51 

I I I I I I 

AGTTCTGCGG TGCCAGGGAG TGGAGCAGAG CTCAGCCCCG TCCCAAACAC AGATGGGACC 6 0 

ATGAACTCCG GACACAGCTT CAGCCAGACC CCCTCGGCCT CCTTCCATGG CGCCGGAGGT 12 0 

GGCTGGGGCC GGCCCAGGAG CTTCCCCAGG GCTCCCACCG TCCATGGCGG TGCGGGGGGA 180 

GCCCGCATCT CCCTGTCCTT CACCACGCGG AGCTGCCCAC CCCCTGGAGG GTCTTGGGGT 240 

TCTGGAAGAA GCAGCCCCCT ACTAGGCGGA AATGGGAAGG CCACCATGCA GAATCTCAAC 3 00 

GACCGCCTGG CCTCCTACCT GGAGAAGGTT CGCGCCCTGG AGGAGGCCAA CATGAAGCTG 3 60 

GAAAGCCGCA TCCTGAAATG GCACCAGCAG AGAGATCCTG GCAGTAAGAA AGATTATTCC 4 20 

CAGTATGAGG AAAACATCAC ACACCTGCAG GAGCAGATAG TGGATGGTAA GATGACCAAT 4 80 

GCTCAGATTA TTCTTCTCAT TGACAATGCC AGGATGGCAG TGGATGACTT CAACCTCAAG 540 

TATGAAAATG AACACTCCTT TAAGAAAGAC TTGGAAATTG AAGTCGAGGG CCTCCGAAGG 600 

ACCTTAGACA ACCTGACCAT TGTCACAACA GACCTAGAAC AGGAGGTGGA AGGAATGAGG 660 

AAAGAGCTCA TTCTCATGAA GGAGCACCAT GAGCAGGAAA TGGAGGAGCA TCATGTGCCA 72 0 

AGTGACTTCA ATGTCAATGT GAAGGTGGAT ACAGGTCCCA GGGAAGATCT GATTT^GGTC 7 80 

CTGGAGGATA TGAGACAAGA ATATGAGCTT ATAATAAAGA AGAAGCATCG AGACTTGGAC 840 

ACTTGGTATA AAGAACAGTC TGCAGCCATG TCCCAGGAGG CAGCCAGTCC AGCCACTGTG 90 0 

CAGAGCAGAC AAGGTGACAT CCACGAACTG AAGCGCACAT TCCAGGCCCT GGAGATTGAC 960 

CTGCAGGCAC AGTACAGCAC GAAATCTGCT TTGGAAAACA TGTTATCCGA GACCCAGTCT 102 0 

CGGTACTCCT GCAAGCTCCA GGACATGCAA GAGATCATCT CCCACTATGA GGAGGAACTG 10 80 

ACGCAGCTAC GCCACGAACT GGAGCGGCAG AACAATGAAT ACCAAGTGCT GCTGGGCATC 114 0 

AAAACCCACC TGGAGAAGGA AATCACCACG TACCGACGGC TCCTGGAGGG AGAGAGTGAA 12 00 

GGGACACGGG AAGAATCAAA GTCGAGCATG AAAGTGTCTG CAACTCCAAA GATCAAGGCC 12 60 

ATAACCCAGG AGACCATCAA CGGAAGATTA GTTCTTTGTC AAGTGAATGA AATCCAAAAG 1320 

CACGCATGAG ACCAATGAAA GTTTCCGCCT GTTGTAAAGT CTATTTTCCC CCAAGGAAAG 13 80 

TCCTTGCACA GACACCAGTG AGTGAGTTCT AAAAGATACC CTTGGAATTA TCAGACTCAG 144 0 

AAACTTTTAT rprprpr,n,prj.r[,rj.rp,p CTGTAACAGT CTCACCAGAC TTCTCATAAT GCTCTTAATA 15 00 

TATTGCACTT TTCTAATCAA AGTGCGAGTT TATGAGGGTA AAGCTCTACT TTCCTACTGC 1560 
AGCCTTCAGA TTCTCATCAT TTTGCATCTA TTTTGTAGCC AATAAAACTC CGCACTAGC 



SEQ ID NO: 18 Protein sequence: 
Protein Accession #: NP_056330.1 

1 11 21 31 41 51 

I I I I I I 

MNSGHSFSQT PSASFHGAGG GWGRPRSFPR APTVHGGAGG ARISLSFTTR SCPPPGGSWG 60 

SGRSSPLIiGG NGKATMQNLN DRLASYLEKV RALEEANMKL ESRILKWHQQ RDPGSKKDYS 12 0 

QYEENITHLQ EQIVDGKMTN AQIILLIDNA RMAVDDFNLK YENEHSFKKD LEIEVEGLRR 180 

TLDNLTIVTT DLEQEVEGMR KELILMKEHH EQEMEEHHVP SDFNVNVKVD TGPREDLIKV 240 

LEDMRQEYEL IIKKKHRDLD TWYKEQSAAM SQEAASPATV QSRQGDIHEL KRTFQALEID 300 

LQAQYSTKSA LENMLSETQS RYSCKLQDMQ EIISHYEEEL TQLRHELERQ NNEYQVLLGI 3 60 

KTHLEKEITT YRRLLEGESE GTREESKSSM KVSATPKIKA ITQETINGRL VLCQVNEIQK 420 
HA 



SEQ ID NO: 19 DNA sequence 

Nucleic Acid Accession #: see Table 25 & 25A for complete list 



1 11 21 31 41 51 

I I I I I I 

TTTTTTTTTT TTAAAAAAAA GAGGCTTGGT AAGTTTTTGA TACTTAGTTG ACTTTTAGCA 60 

TTATCCAGCA TTTGTATTAT GAACCAGTGA GTACTGTAAT TTTTCTTTCC CTTTCAGAAA 120 

GACTCAAAGG GAACATATAA ATGTTTCCTA TTTTTNNNNN NNNNNNNNNN NNNNNNNNNN 180 

NNNNACCCAT CGTGCGATGA TCNNNNNNNN NNNNNNNNNN NNNNNTTGGG ATCCAGTTTC 24 0 

AAATAAGGTA TGGGAAAAAC AGATGTTTTC ATTATCGCCA CTTAATCCTT ACTTCCGATT 3 00 

ATAATTATAC ATGTTTGGCT GTAATAACTA TACTAAAGCA TGCTTGTGAA AGTAGACTTC 360 

TACAAGGACA GAAAACCCAC AACAACAAAG ATCGATCACG AAAGACAAGG CATATTCATT 42 0 

CATTAATTTA CTTCTCTTAG ACCCGGGACA TGTGGGACAA ATACTTTTGT CCTCATGGAT 4 80 

GGCTTGATAA TTTATTTATA TGTTCTAGAG TCTGAGGATT TTCTTTCAGT GGCAGACAAC 54 0 

AAAGGATGTT ACAATTTACT TCAAAATAAT ACAATCATGG TTTAATTTAC AGTGTAAATC 600 

CATAACTATT TTATAGAGAT GGATTATCAT ACATGGGATT ATAAAAATAA CTTACCCATA 660 

TGCTTGCAAA ATAGACTTTT CCTATTGGGA GGAACATCTT TTAACCTAAA ACGGATTTAT 72 0 

TTCAGATGAA TTAGACAGTA CATTTTTCAG GAGAACCAGC CTTACTGGAT GATCTTTTGT 7 80 

CAGGTTTGGA GGCCTCTTCT TTGTCTTTGC AACCATAACC CCTTTTCAGC TGAAGACCAC 84 0 
TGGCCTTCAA CCCAAGCCAG GAGTTTGGCT CAAATGA 



SEQ ID NO: 20 DNA sequence 

Nucleic Acid Accession #: D32051.1 

Coding sequence: 72-1373 

1 11 21 31 41 51 

I I I I I I 

GAATTCGAAC CAGGTGGCCA CCCGGTGTCG GTTTCATTTT CCTTTGGAAT TTCTGCTTTA 60 

CAGACAGAAC AATGGCAGCC CGAGTACTTA TAATTGGCAG TGGAGGAAGG GAACATACGC 12 0 

TGGCCTGGAA ACTTGCACAG TCTCATCATG TCAAACAAGT GTTGGTTGCC CCAGGAAACG 180 

CAGGCACTGC CTGCTCTGAA AAGATTTCAA ATACCGCCAT CTCAATCAGT GACCACACTG 240 

CCCTTGCTCA ATTCTGCAAA GAGAAGAAAA TTGAATTTGT AGTTGTTGGA CCAGAAGCAC 3 00 

CTCTGGCTGC TGGGATTGTT GGGAACCTGA GGTCTGCAGG AGTGCAATGC TTTGGCCCAA 3 60 

CAGCAGAAGC GGCTCAGTTA GAGTCCAGCA AAAGGTTTGC CAAAGAGTTT ATGGACAGAC 42 0 

ATGGAATCCC AACCGCACAA TGGAAGGCTT TCACCAAACC TGAAGAAGCC TGCAGCTTCA 4 80 

TTTTGAGTGC AGACTTCCCT GCTTTGGTTG TGAAGGCCAG TGGTCTTGCA GCTGGAAAAG 54 0 

GGGTGATTGT TGCAAAGAGC AAAGAAGAGG CCTGCAAAGC TGTACAAGAG ATCATGCAGG 600 

AGAAAGCCTT TGGGGCAGCT GGAGAAACAA TTGTCATTGA AGAACTTCTT GACGGAGAAG 660 

AGGTGTCGTG TCTGTGTTTC ACTGATGGCA AGACTGTGGC CCCCATGCCC CCAGCACAGG 72 0 

ACCATAAGCG ATTACTGGAG GGAGATGGTG GCCCTAACAC AGGGGGAATG GGAGCCTATT 7 80 

GTCCAGCCCC TCAGGTTTCT AATGATCTAT TACTAAAAAT TAAAGATACT GTTCTTCAGA 84 0 

GGACAGTGGA TGGCATGCAG CAAGAGGGTA CTCCATATAC AGGTATTCTC TATGCTGGAA 900 

TAATGCTGAC CAAGAATGGC CCAAAAGTTC TAGAGTTTAA TTGCCGTTTT GGTGATCCAG 960 

AGTGCCAAGT AATCCTCCCA CTTCTTAAAA GTGATCTTTA TGAAGTGATT CAGTCCACCT 102 0 

TAGATGGACT GCTCTGCACA TCTCTGCCTG TTTGGCTAGA AAACCACACC GCCCTAACTG 10 8 0 

TTGTCATGGC AAGTAAAGGT TATCCTGGAG ACTACACCAA GGGTGTAGAG ATAACAGGGT 114 0 

TTCCTGAGGC TCAAGCTCTA GGACTGGAGG TGTCCCATGC AGGCACTGCC CTCAAAAATG 12 00 

GCAAAGTAGT AACTCATGGG GGTAGAGTTC TTGCAGTCAC AGCCATCCGG GAAAATCTCA 12 60 

TATCAGCCCT TGAGGAAGCC AAGAAAGGAC TAGCTGCTAT AAAGTTTGAG GGAGCAATTT 132 0 

ATAGGAAAGA CATCGGCTTT CGTGCCATAG CTTTCCTCCA GCAGCCCAGG TAAAACTCTA 13 80 

AGCAAGTTAG CTGTAGTGCC ATTTCAGAAA CTGGCCTAAA TGGCTATGTA GAACATTCCA 1440 

TTAACCCTAT AAGTCATTCA GTATTCTTTT CTCTCTGTGG GAGTGATACA GTCTTGGTTT 15 00 

GTATTTTGTT TGAATCAAAA CTGGTTATAG CAATACTCAA ATGGAAAAAA CTTCATGATA 15 6 0 

GCGTAAGTTT GGAAAGTTTA GCAAAATCAC AGTGGTACTG ATTTTTATTT GTTTTCTATT 162 0 

^rj,rp,prp^;^rprp^ TATATTTTTA ATTTTTTTAA CAGGGTCTTC CTCTCTCGCC CAAGTTCTCA 1680 

TGCCTCAGCC TCCCAAATAG CTGGGACTAC AGGCACAGGC CACCACACCT GGCTAATTTT 174 0 

TTTGTATTTT TTGTGGAGAT GGGGTTCACC ATGTTGCCAA GGCCAGTCTG AAAGCCTGGG 1800 

CTCAAGTGAT CCTCCTGCTT TGGCCTCCCA AAATGCTGGG ACTATAGGCA TGAGGCGCTG 1860 

CACTTGGCCT GATACTGATT TTTATTCCTT GCGTTATCAC ATAGTGTTGT ATTTGAAACA 192 0 

TAGTTCATGG TTTTATCAAA GAACTGAAGA TGAGAATACT GGTCATCTAA CTTTGTAATT 1980 



TGATTTGATT ATACTGTAAA GTTTGACAGT CCCATTTTAA CCTGCGTTTG TATCTATTAC 2 040 
TAAAATGTAT TTTTTGACCT CTTACTGATT CATGGTTGGT ATGTACAAAC TGTTGACTTG 2100 
TAAAATCAAT AAAGTCTTAG TTGG 



SEQ ID NO: 21 Protein sequence: 
Protein Accession #: BAA06809.1 

1 11 21 31 41 51 

I I I I I I 

MAARVLIIGS GGREHTLAWK LAQSHHVKQV LVAPGNAGTA CSEKISNTAI SISDHTALAQ 60 

FCKEKKIEFV WGPEAPLAA GIVGNLRSAG VQCFGPTAEA AQLESSKRFA KEFMDRHGIP 120 

TAQWKAFTKP EEACSFILSA DFPALWKAS GLAAGKGVIV AKSKEEACKA VQEIMQEKAF 180 

GAAGETIVIE ELLDGEEVSC LCFTDGKTVA PMPPAQDHKR LLEGDGGPNT GGMGAYCPAP 240 

QVSNDLLLKI KDTVLQRTVD GMQQEGTPYT GILYAGIMLT KNGPKVLEFN CRFGDPECQV 3 00 

ILPLLKSDLY EVIQSTLDGL LCTSLPVWLE NHTALTWMA SKGYPGDYTK GVEITGFPEA 3 60 

QALGLEVSHA GTALKNGKW THGGRVLAVT AIRENLISAL EEAKKGLAAI KFEGAIYRKD 420 
IGFRAIAFLQ QPR 



SEQ ID NO: 22 DNA sequence 

Nucleic Acid Accession #: EOS cloned 

Coding sequence: 1-2424 

1 11 21 31 41 51 

I I I I I I 

ATGCCCCCTT TCCTGTTGCT GGAGGCCGTC TGTGTTTTCC TGTTTTCCAG AGTGCCCCCA 60 

TCTCTCCCTC TCCAGGAAGT CCATGTAAGC AAAGAAACCA TCGGGAAGAT TTCAGCTGCC 12 0 

AGCAAAATGA TGTGGTGCTC GGCTGCAGTG GACATCATGT TTCTGTTAGA TGGGTCTAAC 180 

AGCGTCGGGA AAGGGAGCTT TGAAAGGTCC AAGCACTTTG CCATCACAGT CTGTGACGGT 24 0 

CTGGACATCA GCCCCGAGAG GGTCAGAGTG GGAGCATTCC AGTTCAGTTC CACTCCTCAT 3 00 

CTGGAATTCC CCTTGGATTC ATTTTCAACC CAACAGGAAG TGAAGGCTUVG AATCAAGAGG 3 60 

ATGGTTTTCA AAGGAGGGCG CACGGAGACG GAACTTGCTC TGAAATACCT TCTGCACAGA 42 0 

GGGTTGCCTG GAGGCAGAAA TGCTTCTGTG CCCCAGATCC TCATCATCGT CACTGATGGG 4 80 

AAGTCCCAGG GGGATGTGGC ACTGCCATCC AAGCAGCTGA AGGAAAGGGG TGTCACTGTG 54 0 

TTTGCTGTGG GGGTCAGGTT TCCCAGGTGG GAGGAGCTGC ATGCACTGGC CAGCGAGCCT 60 0 

AGAGGGCAGC ACGTGCTGTT GGCTGAGCAG GTGGAGGATG CCACCAACGG CCTCTTCAGC 660 

ACCCTCAGCA GCTCGGCCAT CTGCTCCAGC GCCACGCCAG ACTGCAGGGT CGAGGCTCAC 72 0 

CCCTGTGAGC ACAGGACGCT GGAGATGGTC CGGGAGTTCG CTGGCAATGC CCCATGCTGG 7 80 

AGAGGATCGC GGCGGACCCT TGCGGTGCTG GCTGCACACT GTCCCTTCTA CAGCTGGAAG 84 0 

AGAGTGTTCC TAACCCACCC TGCCACCTGC TACAGGACCA CCTGCCCAGG CCCCTGTGAC 900 

TCGCAGCCCT GCCAGAATGG AGGCACATGT GTTCCAGAAG GACTGGACGG CTACCAGTGC 960 

CTCTGCCCGC TGGCCTTTGG AGGGGAGGCT AACTGTGCCC TGAAGCTGAG CCTGGAATGC 102 0 

AGGGTCGACC TCCTCTTCCT GCTGGACAGC TCTGCGGGCA CCACTCTGGA CGGCTTCCTG 10 80 

CGGGCCAAAG TCTTCGTGAA GCGGTTTGTG CGGGCCGTGC TGAGCGAGGA CTCTCGGGCC 114 0 

CGAGTGGGTG TGGCCACATA CAGCAGGGAG CTGCTGGTGG CGGTGCCTGT GGGGGAGTAC 12 00 

CAGGATGTGC CTGACCTGGT CTGGAGCCTC GATGGCATTC CCTTCCGTGG TGGCCCCACC 1260 

CTGACGGGCA GTGCCTTGCG GCAGGCGGCA GAGCGTGGCT TCGGGAGCGC CACCAGGACA 132 0 

GGCCAGGACC GGCCACGTAG AGTGGTGGTT TTGCTCACTG AGTCACACTC CGAGGATGAG 13 8 0 

GTTGCGGGCC CAGCGCGTCA CGCAAGGGCG CGAGAGCTGC TCCTGCTGGG TGTAGGCAGT 144 0 

GAGGCCGTGC GGGCAGAGCT GGAGGAGATC ACAGGCAGCC CAAAGCATGT GATGGTCTAC 150 0 

TCGGATCCTC AGGATCTGTT CAACCAAATC CCTGAGCTGC AGGGGAAGCT GTGCAGCCGG 1560 

CAGCGGCCAG GGTGCCGGAC ACAAGCCCTG GACCTCGTCT TCATGTTGGA CACCTCTGCC 162 0 

TCAGTAGGGC CCGAGAATTT TGCTCAGATG CAGAGCTTTG TGAGAAGCTG TGCCCTCCAG 168 0 

TTTGAGGTGA ACCCTGACGT GACACAGGTC GGCCTGGTGG TGTATGGCAG CCAGGTGCAG 174 0 

ACTGCCTTCG GGCTGGACAC CAAACCCACC CGGGCTGCGA TGCTGCGGGC CATTAGCCAG 180 0 

GCCCCCTACC TAGGTGGGGT GGGCTCAGCC GGCACCGCCC TGCTGCACAT CTATGACAAA 1860 

GTGATGACCG TCCAGAGGGG TGCCCGGCCT GGTGTCCCCA AAGCTGTGGT GGTGCTCACA 192 0 

GGCGGGAGAG GCGCAGAGGA TGCAGCCGTT CCTGCCCAGA AGCTGAGGAA CAATGGCATC 198 0 

TCTGTCTTGG TCGTGGGCGT GGGGCCTGTC CTAAGTGAGG GTCTGCGGAG GCTTGCAGGT 2 04 0 

CCCCGGGATT CCCTGATCCA CGTGGCAGCT TACGCCGACC TGCGGTACCA CCAGGACGTG 2100 



CTCATTGAGT GGCTGTGTGG AGAAGCCAAG CAGCCAGTCA ACCTCTGCAA ACCCAGCCCG 2160 

TGCATGAATG AGGGCAGCTG CGTCCTGCAG AATGGGAGCT ACCGCTGCAA GTGTCGGGAT 2220 

GGCTGGGAGG GCCCCCACTG CGAGAACCGT GAGTGGAGCT CTTGCTCTGT ATGTGTGAGC 22 80 

CAGGGATGGA TTCTTGAGAC GCCCCTGAGG CACATGGCTC CCGTGCAGGA GGGCAGCAGC 234 0 

CGTACCCCTC CCAGCAACTA CAGAGAAGGC CTGGGCACTG AAATGGTGCC TACCTTCTGG 24 00 
AATGTCTGTG CCCCAGGTCC TTAG 



SEQ ID NO: 23 Protein secjuence : 
Protein Accession #: EOS cloned 

1 11 21 31 41 51 

I I I I ' ' 

MPPFLLLEAV CVFIiFSRVPP SLPLQEVHVS KETIGKISAA SKMMWCSAAV DIMFLLDGSN 60 

SVGKGSFERS KHFAITVCDG LDISPERVRV GAFQFSSTPH LEFPLDSFST QQEVKARIKR 12 0 

MVFKGGRTET ELALKYLLHR GLPGGRNASV PQILIIVTDG KSQGDVALPS KQLKERGVTV 180 

FAVGVRFPRW EELHALASEP RGQHVLLAEQ VEDATNGLFS TLSSSAICSS ATPDCRVEAH 24 0 

PCEHRTLEMV REFAGNAPCW RGSRRTLAVL AAHCPFYSWK RVFLTHPATC YRTTCPGPCD 3 00 

SQPCQNGGTC VPEGLDGYQC LCPLAFGGEA NCALKLSLEC RVDLLFLLDS SAGTTLDGFL 3 60 

RAKVFVKRFV RAVLSEDSRA RVGVATYSRE LLVAVPVGEY QDVPDLVWSL DGIPFRGGPT 42 0 

LTGSALRQAA ERGFGSATRT GQDRPRRVW LLTESHSEDE VAGPARHARA RELLLLGVGS 4 80 

EAVRAELEEI TGSPKHVMVY SDPQDLFNQI PELQGKLCSR QRPGCRTQAL DLVFMLDTSA 540 

SVGPENFAQM QSFVRSCALQ FEVNPDVTQV GLWYGSQVQ TAFGLDTKPT RAAMLRAISQ 600 

APYLGGVGSA GTALLHIYDK VMTVQRGARP GVPKAVWLT GGRGAEDAAV PAQKLRNNGI 660 

SVLWGVGPV LSEGLRRLAG PRDSLIHVAA YADLRYHQDV LIEWLCGEAK QPVNLCKPSP 720 

CMNEGSCVLQ NGSYRCKCRD GWEGPHCENR EWSSCSVCVS QGWILETPLR HMAPVQEGSS 780 
RTPPSNYREG LGTEMVPTFW NVCAPGP 



SEQ ID NO: 24 DNA sequence 

Nucleic Acid Accession #: see Table 25 & 25A for complete list 

1 11-21 31 41 51 

I I I I I I 

AGGTCGGCTG GTTATCGGGA GTTGGAGGGC TGAGGTCGGG AGGGTGGTGT GTACAGAGCT 60 

CTAGGACTCA CGCACCAGGC CAGTCGCGGG TTTTGGGCCG AGGCCTGGGT TACAAGCAGC 120 

AAGTGCGCGG TTGGGGCCAC TGCGAGGCCG TTTTAGAAAA CTGTTTAAAA CAAAGAGCAA 180 

TTGATGGATA AATCAGGAAT AGATTCTCTT GACCATGTGA CATCTGATGC TGTGGAACTT 24 0 

GCAAATCGAA GTGATAACTC TTCTGATAGC AGCTTATTTA AAACTCAGTG TATCCCTTAC 3 00 

TCACCTAAAG GGGAGAAAAG AAACCCCATT CGAAAATTTG TTCGTACACC TGAAAGTGTT 360 

CACGCAAGTA TTCATCAAGT GACTCATCTT TTGAACCAGT ACCATTGACT ATAAAAGCTA 420 

TTTTTGAAAG ATTCAAGAAC AGGAAAAAGA GATATAAAAA AAAGAAAAAG AGGAGGTACC 4 80 

AGCCAACAGG AAGACCACGG GGAAGACCAG AAGGAAGGAG AAATCCTATA TACTCACTAA 54 0 

TAGATAAGAA GAAACAATTT AGAAGCAGAG GATCTGGCTT CCCATTTTTA GAATCAGAGA 600 

ATGAAAAAAA CGCACCTTGG AGAAAAATTT TAACGTTTGA GCAAGCTGTT GCAAGAGGAT 660 

TTTTTAACTA TATTGAAAAA CTGAAGTATG AACACCACCT GAAAGAATCA TTGAAGCAAA 720 

TGAATGTTGG TGAAGATTTA GAAAATGAAG ATTTTGACAG TCGTAGATAC AAATTTTTGG 780 

ATGATGATGG ATCCATTTCT CCTATTGAGG AGTCAACGCT TTTATCTTGA GGACATGGTG 84 0 

TCTGGAGTTA AAGGTATTGG CATACTCCAC ACATCTGTAC CATTCTTGAG TGATCGCTTA 900 

GGAATGAATG TGATTTGGAC TCATTCATGT ATGAGAGTAA GCAATGCTTT TTTTTCCAGG 960 

GTGTCAAATT GAGAACCAGG TAGATCCCCA CCACCTACAG TAAAAAGGAC CCTAAAGTAA 1020 

ATTGGTTGAA GAAATTAGAT CCCAAAGATT CTTGGTGAAT TTTGAAGTCT TCATCAGTAT 1080 

ATCCATATTA AAACGAGATG ACAGAAGCCA AAGTAATTAT GGGCTGACAG GACAACTGGA 114 0 

TCAGTTTCAT TAAAAAGGGC AAACTTGAAG ATAAATCTTT TGACTCCAGC TCTTTAGAGG 12 00 

ATCTAAAGTG ACCTTGATGG ACAGTGGAAG AAATCACAAC ATGGAATTCC TCGAATAACA 1260 

ATTTATTGAC TTTAAATAAT TTTGTCTAAT GCTACATATA CACAATTAAA AAACCTTTAC 1320 

ACTATTTCTA GAAAGTCAGC ATGTATTTTT GGCTCGAAGT TTCTCTAGTG TTTTCTGTGG 1380 

AAGGAATAAA AATTTGAGGT TTCAATACAA AAACAAAACA AACAACACGA AACACGAAAA 144 0 

ACAATCTGTT GTGCGGCGCC CCTGGGCCCC TTGAGAGAAA ACTTTTTAGA ACCCCTTTTG 1500 

CGTTGTGGCG GCCCGGGGGC CCCACAGTTG GGTTTAGGTG GGCACCCTTG TGTCTACAAG 1560 

TGGTGTCTCC CCAAGAGAGA GAACACCTCC GGGGTCAAGC GGACAACAAG AGTGCGTCGT 162 0 



GAGGACTCTT CACCCAAAGT ATATAAAACC CGCCCCGCGG GGGAACCACC GGCCGCTTTT 
CTGTAGACAC AACCCCCACA GTGGGAACCT CTGAGGGCGC ACACACAGGG CGAGCCTTAT 
CAACAAGGGG TGCCCAACAG AAACCCCGAG TTAAAAATCG 



1680 
1740 



SEQ ID NO: 25 DNA sequence 

Nucleic Acid Accession #: BC001972.1 
Coding secjuence : 183-1019 

1 11 21 31 41 51 

I I I I I I 

GGTCGGCTGG TTATCGGGAG TTGGAGGGCT GAGGTCGGGA GGGTGGTGTG TACAGAGCTC 60 

TAGGACTCAC GCACCAGGCC AGTCGCGGGT TTTGGGCCGA GGCCTGGGTT ACAAGCAGCA 12 0 

AGTGCGCGGT TGGGGCCACT GCGAGGCCGT TTTAGAAAAC TGTTTAAAAC AAAGAGCAAT 180 

TGATGGATAA ATCAGGAATA GATTCTCTTG ACCATGTGAC ATCTGATGCT GTGGAACTTG 24 0 

CAAATCGAAG TGATAACTCT TCTGATAGCA GCTTATTTAA AACTCAGTGT ATCCCTTACT 3 00 

CACCTAAAGG GGAGAAAAGA AACCCCATTC GAAAATTTGT TCGTACACCT GAAAGTGTTC 3 60 

ACGCAAGTGA TTCATCAAGT GACTCATCTT TTGAACCAAT ACCATTGACT ATAAAAGCTA 42 0 

TTTTTGAAAG ATTCAAGAAC AGGAAAAAGA GATATAAAAA AAAGAAAAAG AGGAGGTACC 4 80 

AGCCAACAGG AAGACCACGG GGAAGACCAG AAGGAAGGAG AAATCCTATA TACTCACTAA 54 0 

TAGATAAGAA GAAACAATTT AGAAGCAGAG GATCTGGCTT CCCATTTTTA GAATCAGAGA 600 

ATGA7\AAAAA CGCACCTTGG AGAAAAATTT TAACGTTTGA GCAAGCTGTT GCAAGAGGAT 660 

TTTTTAACTA TATTGAAAAA CTGAAGTATG AACACCACCT GAAAGAATCA TTGAAGCAAA 72 0 

TGAATGTTGG TGAAGATTTA GAAAATGAAG ATTTTGACAG TCGTAGATAC AAATTTTTGG 780 

ATGATGATGG ATCCATTTCT CCTATTGAGG AGTCAACAGC AGAGGATGAG GATGCAACAC 84 0 

ATCTTGAAGA TAACGAATGT GATATCAAAT TGGCAGGGGA TAGTTTCATA GTAAGTTCTG 900 

AATTCCCTGT AAGACTGAGT GTATACTTAG AAGAAGAGGA TATTACTGAA GAAGCTGCTT 960 

TGTCTAAAAA GAGAGCTACA AAAGCCAAAA ATACTGGACA GAGAGGCCTG AAAATGTGAC 102 0 

AGGATCATGA ATGTCAAAGG CTTTTATCTT GAGAACATGG TGTCTGGAGT TAAAGGACTA 10 80 

TTGTTAGATC TGTGGGAAGG AATTACAAGA CAGTTGCTAA AAGTTTGAAA T^GACGGTTG 114 0 

CTAAACGTTA TGAAAAACCA GATAATCTAC TTTTTTACCT TAGGTATTGG CATACTCCAC 12 00 

ACATCTGTAC CATTCTTGAG TGATCGCTTA GGAATGAATG TGATTTGAAC TCATTCATGT 12 60 

TGAGAGGGTG TCAAATTGAG AACCAGGTAG ATCCCCACCA CCTACAGTAA AAAGGACCCT 13 2 0 

AAAGTAAATT GGTTGAAGAA ATTAGATCCC AAAGATTCTT GGTGAATTTT GAAGTCTTCA 13 80 

TCAGTATATC CATATTAAAA CGAGATGACA GAAGCCAAAG TAATTATGGG CTGACAGGAC 144 0 

AACTGGATCA GTTTCATTAA AAAGGGCAAA CTTGAAGATA AATCTTTTGA CTCCAGCTCT 15 00 

TTAGAGGATC TAAAGTGACC TTGATGGACA GTGGAAGAAA TCACAACATG -GAATTCCTCG 156 0 

AATAACAATT TATTGACTTT AAATAATTTT GTCTAATGCT ACATATACAC AATTT^AAAAA 162 0 
CCTTTACACT AAAAAAAAAA AAAAAA 



SEQ ID NO: 26 Protein sequence: 
Protein Accession #: AAH01972.1 

1 11 21 31 41 51 

I I I I I I 

MDKSGIDSLD HVTSDAVELA NRSDNSSDSS LFKTQCIPYS PKGEKRNPIR KFVRTPESVH 60 
ASDSSSDSSF EPIPLTIKAI FERFKNRKKR YKKKKKRRYQ PTGRPRGRPE GRRNPIYSLI 12 0 
DKKKQFRSRG SGFPFLESEN EKNAPWRKIL TFEQAVARGF FNYIEKLKYE HHLKESLKQM 180 
NVGEDLENED FDSRRYKFLD DDGSISPIEE STAEDEDATH LEDNECDIKL AGDSFIVSSE 240 
FPVRLSVYLE EEDITEEAAL SKKRATKAKN TGQRGLKM 



SEQ ID NO: 27 DNA sequence 

Nucleic Acid Accession #: AK027016 

Coding sequence: 2 07-1043 

1 11 21 31 41 51 

I I I I I I 

CTTTTCTTCC GCACGGTTGG AGGAGGTCGG CTGGTTATCG GGAGTTGGAG GGCTGAGGTC 60 

GGGAGGGTGG TGTGTACAGA GCTCTAGGAC TCACGCACCA GGCCAGTCGC GGATTTTGGG 12 0 



CCGAGGCCTG GGTTACAAGC AGCAAGTGCG CGGTTGGGGC CACTGCGAGG CCGTTTTAGA 180 

AAACTGTTTA AAACAAAGAG CAATTGATGG ATAAATCAGG AATAGATTCT CTTGACCATG 240 

TGACATCTGA TGCTGTGGAA CTTGCAAATC GAAGTGATAA CTCTTCTGAT AGCAGCTTAT 3 00 

TTAAAACTCA GTGTATCCCT TACTCACCTA AAGGGGAGAA AAGTU^CCCC ATTCGAAAAT 360 

TTGTTCGTAC ACCTGAAAGT GTTCACGCAA GTGATTCATC AAGTGACTCA TCTTTTGAAC 420 

CAATACCATT GACTATAAAA GCTATTTTTG AAAGATTCAA GAACAGGAAA AAGAGATATA 4 80 

AAAAAAAGAA AAAGAGGAGG TACCAGCCAA CAGGAAGACC ACGGGGAAGA CCAGAAGGAA 54 0 

GGAGAAATCC TATATACTCA CTAATAGATA AGAAGAAACA ATTTAGAAGC AGAGGATCTG 600 

GCTTCCCATT TTTAGAATCA GAGAATGAAA AAAACGCACC TTGGAGAAAA ATTTTAACGT 660 

TTGAGCAAGC TGTTGCAAGA GGATTTTTTA ACTATATTGA AAAGCTGAAG TATGAACACC 72 0 

ACCTGAAAGA ATCATTGAAG CAAATGAATG TTGGTGAAGA TTTAGAAAAT GAAGATTTTG 78 0 

ACAGTCGTAG ATACAAATTT TTGGATGATG ATGGATCCAT TTCTCCTATT GAGGAGTCAA 840 

CAGCAGAGGA TGAGGATGCA ACACATCTTG AAGATAACGA ATGTGATATC AAATTGGCAG 900 

GGGATAGTTT CATAGTAAGT TCTGAATTCC CTGTAAGACT GAGTGTATAC TTAGAAGAAG 960 

AGGATATTAC TGAAGAAGCT GCTTTGTCTA AAAAGAGAGC TACAAAAGCC AAAAATACTG 102 0 

GACAGAGAGG CCTGAAAATG TGACAGGATC ATGAATGTCA AAGGCTTTTA TCTTGAGAAC 1080 

ATGGTGTCTG GAGTTAAAGG TATTGGCATA CTCCACACAT CTGTACCATT CTTGAGTGAT 114 0 

CGCTTAGGAA TGAATGTGAT TTGAACTCAT TCATGTTGAG AGGGTGTCAA ATTGAGAACC 12 00 

AGGTAGATCC CCACCACCTA CAGTAAAAAG GACCCTAAAG TAAATTGGTT GAAGAAATTA 126 0 

GATCCCAAAG ATTCTTGGTG AATTTTGAAG TCTTCATCAG TATATCCATA TTAAAACGAG 132 0 

ATGACAGAAG CCAAAGTAAT TATGGCAAGT AATGGTTTTT ATCTTAACTA TAAGTTATTT 13 80 

GCTCAAGGGT GTAATGGTCA TTACCAAGGC TTTTAGAATG CAGTTTCTCA TTTGCTGTGG 1440 

ACATGACCAT AAAAAAAAAT TTCCCAGTAG GTTTTCTATC TGCTACGTTG CTAGCAATCA 1500 

GCTTATTGGG AACAGTTGAT TAACTGTAAT AGAAATGCAA TACAAATAAA ATGTGAACCA 1560 

CATGTGATTT TTCTTTAAAA TCAGTGAGAT TTGAAAATTC TCCTAGATCT CTTGAATCAT 1620 

GCAAATTTGC TTTGCCTTTA TATTGTAACC CTTGTGGGTT GCTAATAACC AAGCAGTTTG 1680 

TAGTAGAGTT AACTCAGGCT CGTTCTAGGG ACTCATTCAT GTTCACTCAC TGTACACTCA 174 0 

TCTCTGGAAA TGTAAAATTT ACTTTTATAC TATTGTTATG TAGGGCTGAC AGGACAACTG 18 00 

GATCAGTTTC ATTAAAAAGG TATGTATGCA TTAGAAAAGA CATTTGTATG GGTCATTTCA 18 6 0 

AAGAGGGCTT ATGAGGCTGT GAAACCCAGA GCTCTTAACG CTGTGACCAA AGATGGAAGT 192 0 

TCTCTATAGG AAGCCATAGC ACTCCTAATG TTTGGTGCTA TGTTTTCCTG AGGAGATATA 1980 

AAACGTAATA ATCCATGATT GTTGCCATGT GAGAGTTTTA AAGGTTAATC AAAATTTCTC 2 04 0 

TTCTTCAGGG CAAACTTGAA GATAAATCTT TTGACTCCAG CTCTTTAGAG GATCTAAAGT 210 0 

GACCTTGATG GACAGTGGAA GAAATCACAA CATGGAATTC CTCGAATAAC AATTTATTGA 2160 

CTTTAAATAA TTTTGTCTAA TGCTACATAT ACACAATTAA AAAACCTTTA CACTATTTCT 2 22 0 

AGAAAGTCAG CATGTATTTT TGGCTCGAAG TTTCTCTAGT GTTTTCTGTG GAAGGAATAA 22 80 
AAATTTGAGT TTCAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAA 



SEQ ID NO: 28 Protein sequence: 
Protein Accession #: BAB15628.1 

1 11 21 31 41 51 

I I I I I I 

MDKSGIDSLD HVTSDAVELA NRSDNSSDSS LFKTQCIPYS PKGEKRNPIR KFVRTPESVH 60 

ASDSSSDSSF EPIPLTIKAI FERFKNRKKR YKKKKKRRYQ PTGRPRGRPE GRRNPIYSLI 12 0 

DKKKQFRSRG SGFPFLESEN EKNAPWRKIL TFEQAVARGF FNYIEKLKYE HHLKESLKQM 180 

NVGEDLENED FDSRRYKFLD DDGSISPIEE STAEDEDATH LEDNECDIKL AGDSFIVSSE 24 0 
FPVRLSVYLE EEDITEEAAL SKKRATKAKN TGQRGLKM 



SEQ ID NO: 29 DNA sequence 

Nucleic Acid Accession #: NM_004289.3 
Coding sequence: 4 93-1695 

1 11 21 31 41 51 

I I I I i I 

GCCGCCGCCT CGTCCACCGG AGGAGCCGGC GCCAGCGTGG ACGGCGGCAG CCAGGCTGTG 60 
CAGGGGGGCG GCGGGGACCC CCGAGCGGCT CGGAGTGGCC CCTTGGACGC CGGGGAAGAG 12 0 
GAGAAGGCAC CCGCGGAACC GACGGCTCAG GTGCCGGACG CTGGCGGATG TGCGAGCGAG 180 
GAGAATGGGG TACTAAGAGA AAAGCACGAA GCTGTGGATC ATAGTTCCCA GCATGAGGAA 240 



AATGAAGAAA GGGTGTCAGC CCAGAAGGAG AACTCACTTC AGCAGAATGA TGATGATGAA 300 

AACAAAATAG CAGAGAAACC TGACTGGGAG GCAGAAAAGA CCACTGAATC TAGAAATGAG 3 60 

AGACATCTGA ATGGGACAGA TACTTCTTTC TCTCTGGAAG ACTTATTCCA GTTGCTTTCA 42 0. 

TCACAGCCTG AAAATTCACT GGAGGGCATC TCATTGGGAG ATATTCCTCT TCCAGGCAGT 4 80 

ATCAGTGATG GCATGAATTC TTCAGCACAT TATCATGTAA ACTTCAGCCA GGCTATAAGT 54 0 

CAGGATGTGA ATCTTCATGA GGCCATCTTG CTTTGTCCCA ACAATACATT TAGAAGAGAT 600 

CCAACAGCAA GGACTTCACA GTCACAAGAA CCATTTCTGC AGTTAAATTC TCATACCACC 660 

AATCCTGAGC AAACCCTTCC TGGAACTAAT TTGACAGGAT TTCTTTCACC GGTTGACAAT 72 0 

CATATGAGGA ATCTAACAAG CCAAGACCTA CTGTATGACC TTGACATAAA TATATTTGAT 7 80 

GAGATAAACT TAATGTCATT GGCCACAGAA GACAACTTTG ATCCAATCGA TGTTTCTCAG 84 0 

CTTTTTGATG AACCAGATTC TGATTCTGGC CTTTCTTTAG ATTCAAGTCA CAATAATACC 900 

TCTGTCATCA AGTCTAATTC CTCTCACTCT GTGTGTGATG AAGGTGCTAT AGGTTATTGC 960 

ACTGACCATG AATCTAGTTC CCATCATGAC TTAGAAGGTG CTGTAGGTGG CTACTACCCA 102 0 

GAACCCAGTA AGCTTTGTCA CTTGGATCAA AGTGATTCTG ATTTCCATGG AGATCTTACA 10 8 0 

TTTCAACACG TATTTCATAA CCACACTTAC CACTTACAGC CAACTGCACC AGAATCTACT 114 0 

TCTGAACCTT TTCCGTGGCC TGGGAAGTCA CAGAAGATAA GGAGTAGATA CCTTGAAGAC 12 00 

ACAGATAGAA ACTTGAGCCG TGATGAACAG CGTGCTAAAG CTTTGCATAT CCCTTTTTCT 12 60 

GTAGATGAAA TTGTCGGCAT GCCTGTTGAT TCTTTCAATA GCATGTTAAG TAGATATTAT 132 0 

CTGACAGACC TACAAGTCTC ACTTATCCGT GACATCAGAC GAAGAGGGAA AAATAAAGTT 13 80 

GCTGCGCAGA ACTGTCGTAA ACGCAAATTG GACATAATTT TGAATTTAGA AGATGATGTA 144 0 

TGTAACTTGC AAGCAAAGAA GGAAACTCTT AAGAGAGAGC AAGCACAATG TAACAAAGCT 1500 

ATTAACATAA TGAAACAGAA ACTGCATGAC CTTTATCATG ATATTTTTAG TAGATTAAGA 1560 

GATGACCAAG GTAGGCCAGT CAATCCCAAC CACTATGCTC TCCAGTGTAC CCATGATGGA 1620 

AGTATCTTGA TAGTACCCAA AGAACTGGTG GCCTCAGGCC ACAAAAAGGA AACCCAAAAG 1680 

GGAAAGAGAA AGTGAGAAGA AACTGAAGAT GGACTCTATT ATGTGAAGTA GTAATGTTCA 174 0 

GAAACTGATT ATTTGGATCA GAAACCATTG AAACTGCTTC AAGAATTGTA TCTTTAAGTA 18 00 

CTGCTACTTG AATAACTCAG TTAACGCTGT TTTGAAGCTT ACATGGACAA ATGTTTAGGA 1860 

CTTCAAGATC ACACTTGTGG GCAATCTGGG GGAGCCACAA CTTTTCATGA AGTGCATTGT 1920 

ATACAAAATT CATAGTTATG TCCAAAGAAT AGGTTAACAT GAAAACCCAG TAAGACTTTC 198 0 

CATCTTGGCA GCCATCCTTT TTAAGAGTAA GTTGGTTACT TCAAAAAGAG CAAACACTGG 2 04 0 

GGATCAAATT ATTTTAAGAG GTATTTCAGT TTTAAATGCA AAATAGCCTT ATTTTCATTT 2100 

AGTTTGTTAG CACTATAGTG AGCTTTTCAA ACACTATTTT AATCTTTATA TTTAACTTAT 2160 
AAATTTTGCT TTCT 



SEQ ID NO: 30 Protein sequence: 
Protein Accession #: NP_004280 

1 11 21 31 41 51 

I I I I I I 

MNSSAHYHVN FSQAISQDVN LHEAILLCPN NTFRRDPTAR TSQSQEPFLQ LNSHTTNPEQ 60 

TLPGTNLTGF LSPVDNHMRN LTSQDLLYDL DINIFDEINL MSLATEDNFD PIDVSQLFDE 120 

PDSDSGLSLD SSHNNTSVIK SNSSHSVCDE GAIGYCTDHE SSSHHDLEGA VGGYYPEPSK 180 

LCHLDQSDSD FHGDLTFQHV FHNHTYHLQP TAPESTSEPF PWPGKSQKIR SRYLEDTDRN 240 
LSRDEQRAKA LHIPFSVDEI VGMPVDSFNS- MLSRYYLTDL QVSLIRDIRR RGKNKVAAQN 3 00 

CRKRKLDIIL NLEDDVCNLQ AKKETLKREQ AQCNKAINIM KQKLHDLYHD IFSRLRDDQG 360 

RPVNPNHYAL QCTHDGSILI VPKELVASGH KKETQKGKRK 



SEQ ID NO: 31 DNA sequence 

Nucleic Acid Accession #: NM_033260.1 
Coding sequence: 1-1208 

1 11 21 31 41 51 

I I I I I I 

ATGAAGTTGG AGGTGTTCGT CCCTCGCGCG GCCCACGGGG ACAAGCAGGG CAGTGACCTG 60 
GAGGGCGCGG GCGGCAGCGA CGCGCCGTCC CCGCTGTCGG CGGCGGGAGA CGACTCCCTG 120 
GGCTCAGATG GGGACTGCGC GGCCAAGCCG TCCGCGGGCG GCGGCGCCAG AGATACGCAG 180 
GGCGACGGCG AACAGAGTGC GGGAGGCGGG CCGGGCGCGG AGGAGGCGAT CCCGGCAGCA 24 0 
GCTGCTGCAG CGGTGGTGGC GGAGGGCGCG GAGGCCGGGG CGGCGGGGCC AGGCGCGGGC 3 00 
GGCGCGGGGA GCGGCGAGGG TGCACGCAGC AAGCCATATA CGCGGCGGCC CAAGCCCCCC 360 



TACTCGTACA TCGCGCTCAT CGCCATGGCC ATCCGCGACT CGGCGGGCGG GCGCTTGACG 42 0 

CTGGCGGAGA TCAACGAGTA CCTCATGGGC AAGTTCCCCT TTTTCCGCGG CAGCTACACG 4 80 

GGCTGGCGCA ACTCCGTGCG CCACAACCTT TCGCTCAACG ACTGCTTCGT CAAGGTGCTG 54 0 

CGCGACCCCT CGCGGCCCTG GGGCAAGGAC AACTACTGGA TGCTCAACCC CAACAGCGAG 600 

TACACCTTCG CCGACGGGGT CTTCCGCCGC CGCCGCAAGC GCCTCAGCCA CCGCGCGCCG 660 

GTCCCCGCGC CCGGGCTGCG GCCCGAGGAG GCCCCGGGCC TCCCCGCCGC CCCGCCGCCC 72 0 

GCGCCCGCCG CCCCGGCCTC GCCCCGCATG CGCTCGCCCG CCCGCCAGGA GGAGCGCGCC 780 

AGCCCCGCGG GCAAGTTCTC CAGCTCCTTC GCCATCGACA GCATCCTGCG CAAGCCCTTC 84 0 

CGCAGCCGTC GCCTCAGGGA CACGGCCCCC GGGACGACGC TTCAGTGGGG CGCCGCGCCC 90 0 

TGCCCGCCGC TGCCCGCGTT CCCCGCGCTC CTCCCCGCGG CGCCCTGCAG GGCCCTGCTG 960 

CCGCTCTGCG CGTACGGCGC GGGCGAGCCG GCGCGGCTGG GCGCGCGCGA GGCCGAGGTG 102 0 

CCACCGACCG CGCCGCCCCT CCTGCTTGCA CCTCTCCCGG CGGCGGCCCC CGCCAAGCCA 1080 

CTCCGAGGCC CGGCGGGCGG CGGCGCGCAC CTGTACTGCC CCCTGCGGCT GCCCGCAGCC 114 0 

CTGCAGGCGG CCTTAGTCCG NCGTCCTGGC CCGCACCTGT CGTACCCGGT GGAGACGCTC 12 00 
CTAGCTTGA 



SEQ ID NO: 32 Protein sequence: 
Protein Accession #: NP_150285.1 

1 11 21 31 41 51 

I I I I I I 

MKLEVFVPRA AHGDKQGSDL EGAGGSDAPS PLSAAGDDSL GSDGDCAAKP SAGGGARDTQ 60 

GDGEQSAGGG PGAEEAIPAA AAAAWAEGA EAGAAGPGAG GAGSGEGARS KPYTRRPKPP 12 0 

YSYIALIAMA IRDSAGGRLT LAEINEYLMG KFPFFRGSYT GWRNSVRHNL SLNDCFVKVL 180 

RDPSRPWGKD NYWMLNPNSE YTFADGVFRR RRKRLSHRAP VPAPGLRPEE APGLPAAPPP 24 0 

APAAPASPRM RSPARQEERA SPAGKFSSSF AIDSILRKPF RSRRLRDTAP GTTLQWGAAP 3 00 

CPPLPAFPAL LPAAPCRALL PLCAYGAGEP ARLGAREAEV PPTAPPLLLA PLPAAAPAKP 360 

LRGPAAGGAH LYCPLRLPAA LQAALVRRPG PHLSYPVETL LA 



SEQ ID NO: 33 DNA sequence 

Nucleic Acid Accession #: NM_012128.2 
Coding sequence: 43-2796 

1 11 21 31 41 51 

I I I I I I 

GAACAAACCA ACATTTGAGC CAGGAATAAC TAGAGAGGAA CAATGGGGTT ATTCAGAGGT 60 

TTTGTTTTCC TCTTAGTTCT GTGCCTGCTG CACCAGTCAA ATACTTCCTT CATTAAGCTG 12 0 

AATAATAATG GCTTTGAAGA TATTGTCATT GTTATAGATC CTAGTGTGCC AGAAGATGAA 180 

AAAATAATTG AACAAATAGA GGATATGGTG ACTACAGCTT CTACGTACCT GTTTGAAGCC 24 0 

ACAGAAAAAA GATTTTTTTT CAAAAATGTA TCTATATTAA TTCCTGAGAA TTGGAAGGAA 3 00 

AATCCTCAGT ACAAAAGGCC AAAACATGAA AACCATAAAC ATGCTGATGT TATAGTTGCA 3 60 

CCACCTACAC TCCCAGGTAG AGATGAACCA TACACCAAGC AGTTCACAGA ATGTGGAGAG 42 0 

AAAGGCGAAT ACATTCACTT CACCCCTGAC CTTCTACTTG GAAAAAAACA AAATGAATAT 4 80 

GGACCACCAG GCAAACTGTT TGTCCATGAG TGGGCTCACC TCCGGTGGGG AGTGTTTGAT 54 0 

GAGTACAATG AAGATCAGCC TTTCTACCGT GCTAAGTCAA AAAAAATCGA AGCAACAAGG 600 

TGTTCCGCAG GTATCTCTGG TAGAAATAGA GTTTATAAGT GTCAAGGAGG CAGCTGTCTT 660 

AGTAGAGCAT GCAGAATTGA TTCTACAACA AAACTGTATG GAAAAGATTG TCAATTCTTT 72 0 

CCTGATAAAG TACAAACAGA AAAAGCATCC ATAATGTTTA TGCAAAGTAT TGATTCTGTT 7 80 

GTTGAATTTT GTAACGAAAA AACCCATAAT CAAGAAGCTC CAAGCCTACA AAACATAAAG 840 

TGCAATTTTA GAAGTACATG GGAGGTGATT AGCAATTCTG AGGATTTTAA AAACACCATA 900 

CCCATGGTGA CACCACCTCC TCCACCTGTC TTCTCATTGC TGAAGATCCG TCAAAGAATT 960 

GTGTGCTTAG TTCTTGATAA GTCTGGAAGC ATGGGGGGTA AGGACCGCCT AAATCGAATG 102 0 

AATCAAGCAG CAAAACATTT CCTGCTGCAG ACTGTTGAaA ATGGATCCTG GGTGGGGATG 10 80 

GTTCACTTTG ATAGTACTGC CACTATTGTA AATAAGCTAA TCCAAATAAA AAGCAGTGAT 1140 

GAAAGAAACA CACTCATGGC AGGATTACCT ACATATCCTC TGGGAGGAAC TTCCATCTGC 12 00 

TCTGGAATTA AATATGCATT TCAGGTGATT GGAGAGCTAC ATTCCCAACT CGATGGATCC 12 60 

GAAGTACTGC TGCTGACTGA TGGGGAGGAT AACACTGCAA GTTCTTGTAT TGATGAAGTG 132 0 

AAACAAAGTG GGGCCATTGT TCATTTTATT GCTTTGGGAA GAGCTGCTGA TGAAGCAGTA 13 80 

ATAGAGATGA GCAAGATAAC AGGAGGAAGT CATTTTTATG TTTCAGATGA AGCTCAGAAC 144 0 



AATGGCCTCA TTGATGCTTT TGGGGCTCTT ACATCAGGAA ATACTGATCT CTCCCAGAAG 1500 

TCCCTTCAGC TCGAAAGTAA GGGATTAACA CTGAATAGTA ATGCCTGGAT GAACGACACT 1560 

GTCATAATTG ATAGTACAGT GGGAAAGGAC ACGTTCTTTC TCATCACATG GAACAGTCTG 162 0 

CCTCCCAGTA TTTCTCTCTG GGATCCCAGT GGAACAATAA TGGAAAATTT CACAGTGGAT 1680 

GCAACTTCCA AAATGGCCTA TCTCAGTATT CCAGGAACTG CAAAGGTGGG CACTTGGGCA 174 0 

TACAATCTTC AAGCCAAAGC GAACCCAGAA ACATTAACTA TTACAGTAAC TTCTCGAGCA 1800 

GCAAATTCTT CTGTGCCTCC AATCACAGTG AATGCTAAAA TGAATAAGGA CGTAAACAGT 1860 

TTCCCCAGCC CAATGATTGT TTACGCAGAA ATTCTACAAG GATATGTACC TGTTCTTGGA 192 0 

GCCAATGTGA CTGCTTTCAT TGAATCACAG AATGGACATA CAGAAGTTTT GGAACTTTTG 1980 

GATAATGGTG CAGGCGCTGA TTCTTTCAAG AATGATGGAG TCTACTCCAG GTATTTTACA 2 04 0 

GCATATACAG AAAATGGCAG ATATAGCTTA AAAGTTCGGG CTCATGGAGG AGCAAACACT 2100 

GCCAGGCTAA AATTACGGCC TCCACTGAAT AGAGCCGCGT ACATACCAGG CTGGGTAGTG 2160 

AACGGGGAAA TTGAAGCAAA CCCGCCAAGA CCTGAAATTG ATGAGGATAC TCAGACCACC 222 0 

TTGGAGGATT TCAGCCGAAC AGCATCCGGA GGTGCATTTG TGGTATCACA AGTCCCAAGC 228 0 

CTTCCCTTGC CTGACCAATA CCCACCAAGT CAAATCACAG ACCTTGATGC CACAGTTCAT 2 34 0 

GAGGATAAGA TTATTCTTAC ATGGACAGCA CCAGGAGATA ATTTTGATGT TGGAAAAGTT 24 00 

CAACGTTATA TCATAAGAAT AAGTGCAAGT ATTCTTGATC TAAGAGACAG TTTTGATGAT 2460 

GCTCTTCAAG TAAATACTAC TGATCTGTCA CCAAAGGAGG CCAACTCCAA GGAAAGCTTT 2 52 0 

GCATTTAAAC CAGAAAATAT CTCAGAAGAA AATGCAACCC ACATATTTAT TGCCATTAAA 2 580 

AGTATAGATA AAAGCAATTT GACATCAAAA GTATCCAACA TTGCACAAGT AACTTTGTTT 2 64 0 

ATCCCTCAAG CAAATCCTGA TGACATTGAT CCTACTCCTA CTCCTACTCC TACTCCTGAT 2 700 

AAAAGTCATA ATTCTGGAGT TAATATTTCT ACGCTGGTAT TGTCTGTGAT TGGGTCTGTT 2760 

GTAATTGTTA ACTTTATTTT AAGTACCACC ATTTGAACCT TAACGAAGAA AAAAATCTTC 2 82 0 

AAGTAGACCT AGAAGAGAGT TTTAAAAAAC AAAACAATGT AAGTAAAGGA TATTTCTGAA 2 88 0 

TCTTAAAATT CATCCCATGT GTGATCATAA ACTCATAAAA ATAATTTTAA GATGTCGGAA 2 940 

AAGGATACTT TGATTAAATA AAAACACTCA TGGATATGTA AAAACTGTCA AGATTAAAAT 3 000 

TTAATAGTTT CATTTATTTG TTATTTTATT TGTAAGAAAT AGTGATGAAC AAAGATCCTT 3 060 

TTTCATACTG ATACCTGGTT GTATATTATT TGATGCAACA GTTTTCTGAA ATGATATTTC 312 0 

AAATTGCATC AAGAAATTAA AATCATCTAT CTGAGTAGTC AAAATACAAG TAAAGGAGAG 3180 
CAAATAAACA ACATTTGGAA AAAAAAAAAA AAAAAAAA 



SEQ ID NO: 34 Protein sequence: 
Protein Accession #: NP_036260.1 

1 11 21 31 41 51 

I I I I I I 

MGLFRGFVFL LVLCLLHQSN TSFIKLNNNG FEDIVIVIDP SVPEDEKIIE QIEDMVTTAS 60 

TYLFEATEKR FFFKNVSILI PENWKENPQY KRPKHENHKH ADVIVAPPTL PGRDEPYTKQ 12 0 

FTECGEKGEY IHFTPDLLLG KKQNEYGPPG KLFVHEWAHL RWGVFDEYNE DQPFYRAKSK 180 

KIEATRCSAG ISGRNRVYKC QGGSCLSRAC RIDSTTKLYG KDCQFFPDKV QTEKASIMFM 24 0 

QSIDSWEFC NEKTHNQEAP SLQNIKCNFR STWEVISNSE DFKNTIPMVT PPPPPVFSLL 300 

KIRQRIVCLV LDKSGSMGGK DRLNRMNQAA KHFLLQTVEN GSWVGMVHFD STATIVNKLI 360 

QIKSSDERNT LMAGLPTYPL GGTSICSGIK YAFQVIGELH SQLDGSEVLL LTDGEDNTAS 42 0 

SCIDEVKQSG AIVHFIALGR AADEAVIEMS KITGGSHFYV SDEAQNNGLI DAFGALTSGN 4 80 

TDLSQKSLQL ESKGLTLNSN AWMNDTVIID STVGKDTFFL ITWNSLPPSI SLWDPSGTIM 540 

ENFTVDATSK MAYLSIPGTA KVGTWAYNLQ AKANPETLTI TVTSRAANSS VPPITVNAKM 600 

NKDVNSFPSP MIVYAEILQG YVPVLGANVT AFIESQNGHT EVLELLDNGA GADSFKNDGV 660 

YSRYFTAYTE NGRYSLKVRA HGGANTARLK LRPPLNRAAY IPGWWNGEI EANPPRPEID 72 0 

EDTQTTLEDF SRTASGGAFV VSQVPSLPLP DQYPPSQITD LDATVHEDKI ILTWTAPGDN 780 

FDVGKVQRYI IRISASILDL RDSFDDALQV NTTDLSPKEA NSKESFAFKP ENISEENATH 840 

IFIAIKSIDK SNLTSKVSNI AQVTLFIPQA NPDDIDPTPT PTPTPDKSHN SGVNISTLVL 900 
SVIGSWIVN FILSTTI 



SEQ ID NO: 35 DNA sequence 
Nucleic Acid Accession #: 
Coding secjuence : 217-3171 



NM 000901.1 



1 11 21 31 41 51 

I I I I I I 

CGCGGGAGCC AACTTCAGGC TGCTCAGAGG AAGCCCGTGC AGTCAGTCAC CTGGGTGCAA 60 

GAGCGTTGCT GCCTCGGGCT CTCCCGCTGC AGGGAGAGCG GCACTCGCTG GCCTGGATGT 12 0 

GGTTGGATTT AGGGGGGCTC CGCAGCAGGG GTTTCGTGGC GGTGGCAAGC GCTGCAACAG 180 

GTAGACGGCG AGAGACGGAC CCCGGCCGAG GCAGGGATGG AGACCAAAGG CTACCACAGT 24 0 

CTCCCTGAAG GTCTAGATAT GGAAAGACGG TGGGGTCAAG TTTCTCAGGC TGTGGAGCGT 300 

TCTTCCCTGG GACCTACAGA GAGGACCGAT GAGAATAACT ACATGGAGAT TGTCAACGTA 3 60 

AGCTGTGTTT CCGGTGCTAT TCCAAACAAC AGTACTCAAG GAAGCAGCAA AGAAAAACAA 42 0 

GAACTACTCC CTTGCCTTCA GCAAGACAAT AATCGGCCTG GGATTTTAAC ATCTGATATT 4 80 

AAAACTGAGC TGGAATCTAA GGAACTTTCA GCAACTGTAG CTGAGTCCAT GGGTTTATAT 54 0 

ATGGATTCTG TAAGAGATGC TGACTATTCC TATGAGCAGC AGAACCAACA AGGAAGCATG 600 

AGTCCAGCTA AGATTTATCA GAATGTTGAA CAGCTGGTGA AATTTTACAA AGGAT^TGGC 660 

CATCGTCCTT CCACTCTAAG TTGTGTGAAC ACGCCCTTGA GATCATTTAT GTCTGACTCT 720 

GGGAGCTCCG TGAATGGTGG CGTCATGCGC GCCATTGTTA AAAGCCCTAT CATGTGTCAT 780 

GAGAAAAGCC CGTCTGTTTG CAGCCCTCTG AACATGACAT CTTCGGTTTG CAGCCCTGCT 84 0 

GGAATCAACT CTGTGTCCTC CACCACAGCC AGCTTTGGCA GTTTTCCAGT GCACAGCCCA 900 

ATCACCCAGG GAACTCCTCT GACATGCTCC CCTAATGCTG AAAATCGAGG CTCCAGGTCG 960 

CACAGCCCTG CACATGCTAG CAATGTGGGC TCTCCTCTCT CAAGTCCGTT AAGTAGCATG 102 0 

AAATCCTCAA TTTCCAGCCC TCCAAGTCAC TGCAGTGTAA AATCTCCAGT CTCCAGTCCC 1080 

AATAATGTCA CTCTGAGATC CTCTGTGTCT AGCCCTGCAA ATATTAACAA CTCAAGGTGC 114 0 

TCTGTTTCCA GCCCTTCGAA CACTAATAAC AGATCCACGC TTTCCAGTCC GGCAGCCAGT 1200 

ACTGTGGGAT CTATCTGTAG CCCTGTAAAC AATGCCTTCA GCTACACTGC TTCTGGCACC 1260 

TCTGCTGGAT CCAGTACATT GCGGGATGTG GTTCCCAGTC CAGACACGCA GGAGAAAGGT 1320 

GCTCAAGAGG TCCCTTTTCC TAAGACTGAG GAAGTAGAGA GTGCCATCTC AAATGGTGTG 1380 

ACTGGCCAGC TTAATATTGT CCAGTACATA AAACCAGAAC CAGATGGAGC TTTTAGCAGC 1440 

TCATGTCTAG GAGGAAATAG CAAAATT^AAT TCGGATTCTT CATTCTCAGT ACCAATAAAG 1500 

CAAGAATCAA CCAAGCATTC ATGTTCAGGC ACCTCTTTTA AAGGGAATCC AACAGTAAAC 1560 

CCGTTTCCAT TTATGGATGG CTCGTATTTT TCCTTTATGG ATGATAAAGA CTATTATTCC 162 0 

CTATCAGGAA TTTTAGGACC ACCTGTGCCC GGCTTTGATG GTAACTGTGA AGGCAGCGGA 1680 

TTCCCAGTGG GTATTAAACA AGAACCAGAT GACGGGAGCT ATTACCCAGA GGCCAGCATC 1740 

CCTTCCTCTG CTATTGTTGG GGTGAATTCA GGTGGACAGT CCTTCCACTA CAGGATTGGT 1800 

GCTCAAGGTA CAATATCTTT ATCACGATCG GCTAGAGACC AATCTTTCCA ACACCTGAGT 1860 

TCCTTTCCTC CTGTCAATAC TTTAGTGGAG TCATGGAAAT CACACGGCGA CCTGTCGTCT 192 0 

AGAAGAAGTG ATGGGTATCC GGTCTTAGAA TACATTCCAG AAAATGTATC AAGCTCTACT 1980 

TTACGAAGTG TTTCTACTGG ATCTTCAAGA CCTTCAAAAA TATGTTTGGT GTGTGGGGAT 2 04 0 

GAGGCTTCAG GATGCCATTA TGGGGTAGTC ACCTGTGGCA GCTGCAAAGT TTTCTTCAAA 2100 

AGAGCAGTGG AAGGGCAACA CAACTATTTA TGTGCTGGAA GAAATGATTG CATCATTGAT 2160 

AAGATTCGAC GAAAGAATTG TCCTGCTTGC AGACTTCAGA AATGTCTTCA AGCTGGAATG 2220 

AATTTAGGAG CACGAAAGTC AAAGAAGTTG GGAAAGTTAA AAGGGATTCA CGAGGAGCAG 22 80 

CCACAGCAGC AGCAGCCCCC ACCCCCACCC CCACCCCCGC AAAGCCCAGA GGAAGGGACA 2 34 0 

ACGTACATCG CTCCTGCAAA AGAACCCTCG GTCAACACAG CACTGGTTCC TCAGCTCTCC 2400 

ACAATCTCAC GAGCGCTCAC ACCTTCCCCC GTTATGGTCC TTGAAAACAT TGAACCTGAA 24 60 

ATTGTATATG CAGGCTATGA CAGCTCAAAA CCAGATACAG CCGAAAATCT GCTCTCCACG 2 520 

CTCAACCGCT TAGCAGGCAA ACAGATGATC CAAGTCGTGA AGTGGGCAAA GGTACTTCCA 2580 

GGATTTAAAA ACTTGCCTCT TGAGGACCAA ATTACCCTAA TCCAGTATTC TTGGATGTGT 2640 

CTATCATCAT TTGCCTTGAG CTGGAGATCG TACAAACATA CGAACAGCCA ATTTCTCTAT 2700 

TTTGCACCAG ACCTAGTCTT TAATGAAGAG AAGATGCATC AGTCTGCCAT GTATGAACTA 2 760 

TGCCAGGGGA TGCACCAAAT GAGGCTTCAG TTCGTTCGAC TGCAGCTCAC CTTTGAAGAA 2 820 

TACACCATCA TGAAAGTTTT GCTGCTACTA AGCACAATTC CAAAGGATGG CCTCAAAAGC 2 880 

CAGGCTGCAT TTGAAGAAAT GAGGACAAAT TACATCAAAG AACTGAGGAA GATGGTAACT 2940 

AAGTGTCCCA ACAATTCTGG GCAGAGCTGG CAGAGGTTCT ACCAACTGAC CAAGCTGCTG 3000 

GACTCCATGC ATGACCTGGT GAGCGACCTG CTGGAATTCT GCTTCTACAC CTTCCGAGAG 3060 

TCCCATGCGC TGAAGGTAGA GTTCCCCGCA ATGCTGGTGG AGATCATCAG CGACCAGCTG 312 0 

CCCAAGGTGG AGTCGGGGAA CGCCAAGCCG CTCTACTTCC ACCGGAAGTG ACTGCCCGCT 3180 

GCCCAGAAGA ACTTTGCCTT AAGTTTCCCT GTGTTGTTCC ACACCCAGAA GGACCCAAGA 3240 

AAACCTGTTT TTAACATGTG ATGGTTGATT CACACTTGTT CAACAGTTTC TCAAGTTTAA 3300 



AGTCATGTCA GAGGTTTGGA GCCGGGAAAG 
GCAGTCTGAA GGATTCCCCA CCTCCAATCC 
CGGGATGAAA AGCCATATCT AGTCAATAAC 
AGTTTTAACT ATGCCGTGTA GTTTCTGGTA 
CTAACGAACG TTTTAAAGCT TACCCTTGGT 
CATTAATATT CTTTTGTTAT T7UVAAAAACA 
TCCTGTTGTG TAATAACAGA ACTCGTGGCG 
CGTCCACGGT CTTCTGCATC ACTGGTATAC 
TAGAATGGAT AAGATGATGT TAAATGCCTT 
TTTACAAATA ATTTTTTGCA GTCTTTTGCT 
GAAGGACCTG AAATAAGGTA TAAGGATCTT 
TTAATGGTTA ATAGGATGAA TGGGAAAGTA 
TATTTAAGTA CTATCTTTGT CTAAACAAGG 
CATGCATAAT GAACCGTTTA CAGTGTATTT 
TCATATCTAA TTTACCATTA TTTTACAGTC 
TCGGTTGTCT TGAAACCTTT CTACTTTTTC 
CTTGGGGCAT TTGTGGGACT CCAGCACGTT 
GGGAATGACA CACTGCGGTG TCTGCAGCTC 
GAGAGGAGGT CCTTGGAGTG CTGCAGCTTT 
TAGAAGCAGA ACCAGCTCTA CGAGAGTGAG 
TTTCCTCCCA TTTTAGAAAA TCAGATTTTC 
TCTCTCTGTT AAAGATCAGC TATTCCCTTC 
AACCGGTCAC AGGAACGCAC AGATCATGGC 
GGTTCAGTCT GAATGGCATG GAGACCGGGA 
GGTTTTAAAA ATAAGTTTTA TACATAAACA 
CAAGACAGTG GCACTAAAAT GTTTAATTCA 
TATGGCTTTT CTCTTGCCCC AAATCACAAA 
GATTGCACTA AATAAACTAC TTTGAATAGA 
ATCATCAGGT TTACTCAGTG AAATCATATT 
TGGAGAAGCC CATGGCCACG AGGAAGCAGC 
TTCACCATGC AGGCAACATT ACCTTGTCTT 
AACCTGTGAA AGCTGCACTC AGAGGGAGGA 
ATTTAATTTA TAGGCAGATC TTTAAATACA 
CCACACTTTG AAGGTAATAA ATACACAGCA 
GCTTACTTAC AAAAGCAGCT TTTAGTTCAG 
TTACTTAACC AGGTTTGGGA TGGAGATGGT 
CTGGCTCTTT GTGTGTCTCC AGGTAACTTT 
ATTCACTGTT GAATGCCTGT CCCAGTCCAA 
ATTGCTCTTA AAAATCTTGG TTTGGTACAG 
TAAAGAAACA CTAAATTAGT TTAAAATGAA 
TCTGTCTTTG CAAAGGACTG TAAGCAGATT 



CTGTTTTTCC GTGGATTTGG CGAGACCAGA 
CCCAGCGCTT AGAAACATGT TCCTGTTCCT 
TCTGATTTTG ATATTTTCAC AGATGGAAGA 
TCGTTCGCTT GTTTTAAAAG GGTTCAAGGA 
TTGCACATAA AACGTATAGT CAATATGGGG 
CAAAAAAATA ATAAAAAAAT ATATACAGAT 
TGGGGCAGCA GCTGCCTCTG AGCCCTCGCT 
ACACTCGTTA GCGTCCATTT CTTATTTAAT 
GGTTTGATTT CTAGTATCTA TTGTGTTGGC 
GTGCTGTACA TTACTGTATG TATAAATTAT 
TTGTAAATGA GACACATACA AAAAAAATCT 
TTTTTGAAAG AATTCTATTT TGCTGGAGAC 
TAATTTTTTT TTGTAAAGTG CAATGTCCTG 
AAGAAAGGGA AAGCTGTGCC TTTTTTAGCT 
TCTGTTGTAA ATAACCACAC TGAAACCTCT 
TGTACTTTTT GTTTTGTTCT TGGTCTCCCG 
TTCTGGCTTC TGCTTCATCC TGCTCCATCG 
CTGGAAGGTG TCATTTGACA ACACATGTGG 
GGGAAAGCCT GCCTCGTTTC CCTTTTCCTC 
ACTGGGAACT TGATGGCTCA GAGAGCATCT 
TCCTGTGGGA AAAAAAAATT CCATGCACTC 
TGATCTTGGA AAGAGGTTCT GCACTCCTGG 
AGGATGCGCT GGGACGGCCC ATCTTGGCAA 
GATAGAGGGG TTTTAGATTT TTAAAAGGTA 
GTTTTGGAGA AAAATTACAG ATCATATAAG 
TTAATCTGTT TGTTTGGCAC TGATGCAATG 
CATATGTATC TTTGGGGAAA CTAACAATAT 
GGCCAAATTA ATCTTTTAAA AATGATGATA 
AATTATTTTC CAAAATCTAA AAGCTGTAGC 
AATTAATTAG ATCAACACTT TTCTCCAGGG 
TCAAAAGACA CCTGCCTTAG TGCAAGGGGA 
GTCTTTCTTA CATAATTTGC AATTTCAGGA 
GTCAACTTAC GGTGCACAGT AATATGAAAG 
TGCAGACTGG GAGTTGCTAG CAAACAAATG 
ACTTAGTTTT TATAAAATGA GAATTCTGAC 
CTGCATCAGC TTTTTGTATT AACAAAGTTA 
GCTTGATTAA ACAGCAAAGC CATATTCTAA 
ATTGTCTGTC TGCTCTTATT TTTGTACCAT 
TTCATAATTC ACCAAAAAGT TCATATAATT 
GCAATTTATA TCTTTATGCA AAAACATATG 
ACAATAAATC CTTTACTTT 



SEQ ID NO: 36 Protein sequence: 
Protein Accession #: NP 000892.1 



I 

METKGYHSLP 
QGSSKEKQEL 
QQNQQGSMSP 
VKSPIMCHEK 
AENRGSRSHS 
ANINNSRCSV 
SPDTQEKGAQ 
SSFSVPIKQE 
DGNCEGSGFP 
DQSFQHLSSF 
KICLVCGDEA 
QKCLQAGMNL 
TALVPQLSTI 



11 
I 

EGLDMERRWG 
LPCLQQDNNR 
AKIYQNVEQL 
SPSVCSPLNM 
PAHASNVGSP 
SSPSNTNNRS 
EVPFPKTEEV 
STKHSCSGTS 
VGIKQEPDDG 
PPVNTLVESW 
SGCHYGWTC 
GARKSKKLGK 
SRALTPSPVM 



21 

I 

QVSQAVERSS 
PGILTSDIKT 
VKFYKGNGHR 
TSSVCSPAGI 
LSSPLSSMKS 
TLSSPAASTV 
ESAISNGVTG 
FKGNPTVNPF 
SYYPEASIPS 
KSHGDLSSRR 
GSCKVFFKRA 
LKGIHEEQPQ 
VLENIEPEIV 



31 

I 

LGPTERTDEN 
ELESKELSAT 
PSTLSCVNTP 
NSVSSTTASF 
SISSPPSHCS 
GSICSPVNNA 
QLNIVQYIKP 
PFMDGSYFSF 
SAIVGVNSGG 
SDGYPVLEYI 
VEGQHNYLCA 
QQQPPPPPPP 
YAGYDSSKPD 



41 

I 

NYMEIVNVSC 
VAESMGLYMD 
LRSFMSDSGS 
GSFPVHSPIT 
VKSPVSSPNN 
FSYTASGTSA 
EPDGAFSSSC 
MDDKDYYSLS 
QSFHYRIGAQ 
PENVSSSTLR 
GRNDCIIDKI 
PQSPEEGTTY 
TAENLIiSTIiN 



51 

I 

VSGAIPNNST 
SVRDADYSYE 
SVNGGVMRAI 
QGTPLTCSPN 
VTLRSSVSSP 
GSSTLRDWP 
LGGNSKINSD 
GILGPPVPGF 
GTISLSRSAR 
SVSTGSSRPS 
RRKNCPACRL 
lAPAKEPSVN 
RLAGKQMIQV 



3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
3960 
4020 
4080 
4140 
4200 
4260 
4320 
4380 
4440 
4500 
4560 
4620 
4680 
4740 
4800 
4860 
4920 
4980 
5040 
5100 
5160 
5220 
5280 
5340 
5400 
5460 
5520 
5580 
5640 
5700 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 



VKWAKVLPGF KNLPLEDQIT LIQYSWMCLS SFALSWRSYK HTNSQFLYFA PDLVFNEEKM 840 

HQSAMYELCQ GMHQISLQFV RLQLTFEEYT IMKVLLLLST IPKDGLKSQA AFEEMRTNYI 900 

KELRKMVTKC PNNSGQSWQR FYQLTKLLDS MHDLVSDLLE FCFYTFRESH ALKVEFPAML 960 
VEIISDQLPK VESGNAKPLY FHRK 



SEQ ID NO: 37 DNA sequence 

Nucleic Acid Accession #: see Table 25 & 25A for complete list 

1 11 21 31 41 51 

I I I I I I 

CCTACCAGGT TCAAGCAACT CTGCTGCCTC AGCTCCCAAG TAGCTGGGAT TACAGGTGCA 60 

TGCCACTACA CCTGGCTTTT TGTATTTTTA GTAGAGATGG TTTTCACTAT GTTGGCCAGG 12 0 

CTGATCTTGA ATTCCTGGCC TGAAGTAATC TGCCTGCCTC AGCCTCCCAA AGTGCTGGGA 180 

TTATAGGAGC CACCACACCT GGCATAACTG GTATTTTTTA TATGCTTCCT GGGCAACTTA 24 0 

AAAAATTGAT TACTCTGTTG TTTCTTCCTT ttTTTTTTTT TTTTGGCTTT GACCAATTTG 3 00 

TGAGACCCAA GTATCTCCTA CCTAGAAAAA AAACACACTA AACAGTAAAT GATTACCAAC 360 

CTATTTGGAA CAAATCTCAA TTAATTAACA TATACTTCAA GGAGAAGACT TAACAAAATC 42 0 

TTACTTTTCA TTCTTAATAG CTCTTTCCAT AAAAATGTTC CACAAGTGTA TCAAATTAGT 480 

CCTAACAACT ACTGTTAAGT GATTAATGAA ACAGGAGTGA CAGGAGTGAA TTTAATAATA 54 0 

GCAATAAATA CAGATGGGAC TACATAAATT GTGGAGGTCC TGATGCAAAA CTCTCTCTGT 600 

ATTCGATGGC ATCTCAGCTT TCTCATAGAG CTGTTTCACT GTGAGGGTCT TTATCCTTCA 660 

TGCAGAGCTT CATTATTTTC TTTCTTCTAG CAATCAGTCC AAAGCACAAT GTCAGAAAGA 72 0 

TCACAACACA TGCAGCAATA ATGGGCTCTA TTGGTACACC CACAGTTTTA TCTTTAACAA 780 
TC 



SEQ ID NO: 38 DNA sequence 

Nucleic Acid Accession #: NM_001192.1 
Coding sequence: 219-773 

1 11 21 31 41 51 

I I I I I I 

AAGACTCAAA CTTAGAAACT TGAATTAGAT GTGGTATTCA AATCCTTACG TGCCGCGAAG 60 

ACACAGACAG CCCCCGTAAG AACCCACGAA GCAGGCGAAG TTCATTGTTC TCAACATTCT 12 0 

AGCTGCTCTT GCTGCATTTG CTCTGGAATT CTTGTAGAGA TATTACTTGT CCTTCCAGGC 180 

TGTTCTTTCT GTAGCTCCCT TGTTTTCTTT TTGTGATCAT GTTGCAGATG GCTGGGCAGT 24 0 

GCTCCCAAAA TGAATATTTT GACAGTTTGT TGCATGCTTG CATACCTTGT CAACTTCGAT 3 00 

GTTCTTCTAA TACTCCTCCT CTAACATGTC AGCGTTATTG TAATGCAAGT GTGACCAATT 360 

CAGTGAAAGG AACGAATGCG ATTCTCTGGA CCTGTTTGGG ACTGAGCTTA ATAATTTCTT 42 0 

TGGCAGTTTT CGTGCTAATG TTTTTGCTAA GGAAGATAAG CTCTGAACCA TTAAAGGACG 48 0 

AGTTTAAAAA CACAGGATCA GGTCTCCTGG GCATGGCTAA CATTGACCTG GAAAAGAGCA 54 0 

GGACTGGTGA TGAAATTATT CTTCCGAGAG GCCTCGAGTA CACGGTGGAA GAATGCACCT 600 

GTGAAGACTG CATCAAGAGC AAACCGAAGG TCGACTCTGA CCATTGCTTT CCACTCCCAG 660 

CTATGGAGGA AGGCGCAACC ATTCTTGTCA CCACGAAAAC GAATGACTAT TGCAAGAGCC 720 

TGCCAGCTGC TTTGAGTGCT ACGGAGATAG AGAAATCAAT TTCTGCTAGG TAATTAACCA 780 

TTTCGACTCG AGCAGTGCCA CTTTAAAAAT CTTTTGTCAG AATAGATGAT GTGTCAGATC 840 

TCTTTAGGAT GACTGTATTT TTCAGTTGCC GATACAGCTT TTTGTCCTCT AACTGTGGAA 900 

ACTCTTTATG TTAGATATAT TTCTCTAGGT TACTGTTGGG AGCTTAATGG TAGAAACTTC 960 
CTTGGTTTCA TGATTAAAGT CTTTTTTTTT CCTGA 



SEQ ID NO: 39 Protein sequence: 
Protein Accession #: NP_001183.1 

1 11 21 31 41 51 

I I I I I I 

MLQMAGQCSQ NEYFDSLLHA CIPCQLRCSS NTPPLTCQRY CNASVTNSVK GTNAILWTCL 60 
GLSLIISLAV FVLMFLLRKI SSEPLKDEFK NTGSGLLGMA NIDLEKSRTG DEIILPRGLE 12 0 
YTVEECTCED CIKSKPKVDS DHCFPLPAME EGATILVTTK TNDYCKSLPA AliSATEIEKS 180 
ISAR 



SEQ ID NO: 40 DNA sequence 

Nucleic Acid Accession #: NM_025087.1 
Coding sequence: 183-22 82 

1 11 21 31 41 51 

I I I I I I 

ACACTGCCTC GGTTCGGCAA GTGGGTCAGT TGGCTGGGGC TCACTTGGCA ACGGGACGCG 60 

GGAACGAGGG GCGCGGACGC AGGCCCGGGA GGACGCGGCG GCGGGAACCT GGGGGCGCAG 12 0 

GGCTAGGGCA GCGGGCCCGA CCCGCACGGC TTTCCTGGAA AGCGCTGCCC CTCGCCGCGG 180 

CGATGACCTC GCTGTGGAGA GAAATCCTCT TGGAGTCGCT GCTGGGATGT GTTTCTTGGT 24 0 

CTCTCTACCA TGACCTGGGA CCGATGATCT ATTACTTTCC TTTGCAAACA CTAGAACTCA 3 00 

CTGGGCTTGA AGGTTTTAGT ATAGCATTTC TTTCTCCAAT ATTCCTAACA ATTACTCCTT 360 

TCTGGAAATT GGTTAACAAG AAGTGGATGC TAACCCTGCT GAGGATAATC ACTATTGGCA 42 0 

GCATAGCCTC CTTCCAGGCT CCAAATGCCA AACTTCGACT GATGGTTCTT GCGCTTGGGG 4 80 

TGTCTTCCTC ACTGATAGTG CAAGCTGTGA CTTGGTGGTC AGGAAGTCAT TTGCAAAGGT 54 0 

ACCTCAGAAT TTGGGGATTC ATTTTAGGAC AGATTGTTCT TGTTGTTCTA CGCATATGGT 600 

ATACTTCACT AAACCCAATC TGGAGTTATC AGATGTCCAA CAAAGTGATA CTGACATTAA 660 

GTGCCATAGC CACACTTGAT CGTATTGGCA CAGATGGTGA CTGCAGTAAA CCTGAAGAAA 72 0 

AGAAGACTGG TGAGGTAGCC ACGGGGATGG CCTCTAGACC CAACTGGCTG CTGGCAGGGG 780 

CTGCTTTTGG TAGCCTTGTG TTCCTCACCC ACTGGGTTTT TGGAGAAGTC TCTCTTGTTT 84 0 

CCAGATGGGC AGTGAGTGGG CATCCACATC CAGGGCCAGA TCCTAACCCA TTTGGAGGTG 900 

CAGTACTGCT GTGCTTGGCA AGTGGATTGA TGCTTCCATC TTGTTTGTGG TTTCGTGGTA 960 

CTGGTTTGAT CTGGTGGGTT ACAGGAACAG CTTCAGCTGC GGGGCTCCTT TACCTGCACA 102 0 

CATGGGCAGC TGCTGTGTCT GGCTGTGTCT TCGCCATCTT TACTGCATCC ATGTGGCCCC 1080 

AAACACTTGG ACACCTTATT AACTCAGGGA CAAACCCTGG GAAAACCATG ACCATTGCCA 114 0 

TGATATTTTA TCTTCTAGAA ATATTTTTCT GTGCCTGGTG CACAGCTTTT AAGTTTGTCC 12 00 

CAGGAGGTGT CTACGCTAGA GAAAGATCAG ATGTGCTTTT GGGGACAATG ATGTTAATTA 12 60 

TCGGGCTGAA TATGCTATTT GGTCCTAAGA AAAACCTTGA TTTGCTTCTT CAAACAAAAA 132 0 

ACAGTTCTAA AGTGCTTTTC AGAAAGAGTG AAAAATACAT GAAACTTTTT CTGTGGCTGC 13 80 

TTGTTGGTGT GGGATTGTTG GGATTAGGAC TACGGCATAA AGCCTATGAG AGAAAACTGG 144 0 

GCAAAGTGGC ACCAACCAAA GAGGTCTCTG CTGCCATCTG GCCTTTCAGG TTTGGATATG 1500 

ACAATGAAGG GTGGTCTAGT CTAGAAAGAT CAGCTCACCT GCTCAATGAA ACAGGTGCAG 1560 

ATTTCATAAC AATTTTGGAG AGTGATGCTT CTAAGCCCTA TATGGGGAAC AATGACTTAA 162 0 

CCATGTGGCT AGGGGAAAAG TTGGGTTTCT ATACAGACTT TGGTCCAAGC ACAAGGTATC 1680 

ACACTTGGGG GATTATGGCT TTGTCAAGAT ACCCAATTGT GAAATCTGAG CATCACCTTC 1740 

TTCCGTCACC AGAGGGCGAG ATCGCACCAG CCATCACATT GACCGTTAAC ATTTCGGGCA 1800 

AGCTGGTGGA TTTTGTCGTG ACACACTTTG GGAACCACGA AGATGACCTC GACAGGAAAC 1860 

TGCAGGCTAT TGCTGTTTCA AAACTACTGA AAAGTAGCTC TAATCAAGTG ATATTTCTGG 192 0 

GATATATCAC TTCAGCACCT GGCTCCAGAG ATTATCTACA GCTCACTGAA CATGGCAATG 1980 

TGAAGGATAT CGACAGCACT GATCATGACA GATGGTGTGA ATACATTATG TATCGAGGGC 2 040 

TGATCAGGTT GGGTTATGCA AGAATCTCCC ATGCTGAACT GAGTGATTCA GAAATTCAGA 2100 

TGGCAAAATT TAGGATCCCT GATGACCCCA CTAATTATAG AGACAACCAG AAAGTGGTCA 2160 

TAGACCACAG AGAAGTTTCT GAGAAAATTC ATTTTAATCC CAGATTTGGA TCCTACAAAG 2 22 0 

AAGGACACAA TTATGAAAAC AACCATAATT TTCATATGAA TACTCCCAAA TACTTTTTAT 22 80 

GAAACATTTA AAACAAGAAG TTATTGGCTG GGAAAATCTA AGAAAAAAAG TATGTAAGAT 2340 

AAAAAGAAGA GATTAATGAA AGTGGGAAAA TACACATGAA GAACCTCAAC TTAAAAAACA 24 00 

CATGGTATCT ATGCAGTGGG AAATTACCTC CATTTGTAAA CTATGTTGCT TAATAAAAAC 2460 
ATTTCTCTAA AAAAAAAAAA AAAAAA 



SEQ ID NO: 41 Protein sequence: 
Protein Accession #: NP 079363.1 



1 
I 

MTSLWREILL 
WKLVNKKWML 
LRIWGFILGQ 
KTGEVATGMA 
VLLCIiASGLM 
TLGHLINSGT 



11 
I 

ESLLGCVSWS 
TLLRIITIGS 
IVLWLRIWY 
SRPNWIiLAGA 
LPSCLWFRGT 
NPGKTMTIAM 



21 
I 

LYHDLGPMIY 
lASFQAPNAK 
TSLNPIWSYQ 
AFGSLVFLTH 
GLIWWVTGTA 
IFYLLEIFFC 



31 

YFPLQTLELT 
LRLMVLALGV 
MSNKVILTLS 
WVFGEVSLVS 
SAAGLLYLHT 
AWCTAFKFVP 



41 

I 

GLEGFSIAFL 
SSSLIVQAVT 
AIATLDRIGT 
RWAVSGHPHP 
WAAAVSGCVF 
GGVYARERSD 



51 
I 

SPIFLTITPF 
WWSGSHLQRY 
DGDCSKPEEK 
GPDPNPFGGA 
AIFTASMWPQ 
VliLGTMMLII 



60 
120 
180 
240 
300 
360 



GLNMLFGPKK NLDLLLQTKN SSKVLFRKSE KYMKLFLWLL VGVGLLGLGL RHKAYERKLG 42 0 

KVAPTKEVSA AIWPFRFGYD NEGWSSLERS AHLLNETGAD FITILESDAS KPYMGNNDLT 480 

MWLGEKLGFY TDFGPSTRYH TWGIMALSRY PIVKSEHHLL PSPEGEIAPA ITLTVNISGK 540 

LVDFWTHFG NHEDDLDRKL QAIAVSKLLK SSSNQVIFLG YITSAPGSRD YLQLTEHGNV 600 

KDIDSTDHDR WCEYIMYRGL IRLGYARISH AELSDSEIQM AKFRIPDDPT NYRDNQKWI 660 
DHREVSEKIH FNPRFGSYKE GHNYENNHNF HMNTPKYFL 



TABLE 25: 



Table 25 depicts SEP ID NO: Sog ID No. , UnigenelD, UnigeneTitle, Pkey, and ExAccn for 
all of the sequences in Table 26. SEP ID NO: Scq ID No links the nucleic acid and protein 
sequence information in Table 26 to Table 25. 

Pkey: Unique Eos probeset identifier number 

ExAccn: Exemplar Accession number, Genbank accession number 

UnigeneiD: Unigene number 

Unigene Title: Unigene gene title 

SEQ ID NO: Soq. l D.No.: SEQ ID NO: Soquonco I dontification Numb er found in Table 26 

Pkey ExAccn UnlgenelD Unigene TiUe SEQ ID N0:Se^4&^ 

426101 AL049987 Homo sapiens mRNA; cDNA DKFZp564F1 12 (fr 44 

419145 N99638 gb 6&7§^ 

426818 AA554827 Hs.340046 DKFZp434A01 31 protein 8&9?^ 

421057 T58283 Homo sapiens cDNA 10 9 

446619 AU076643 Hs.313 secreted phosphoprotein 1 (osteopontin. 11&1210&11 

431958 X63629 Hs.2877 cadherin 3, type 1 . P-cadherin (placenta 13&1412&13 

409041 AB033025 Hs.50081 HypotheUcal protein, XP^OSI 860 (KIAA1 19 15&1644^ 

443162 T49951 Hs.9029 DKFZP434G032 protein 17 & 18 

436385 BE551618 Hs.144097 ESTs 19-21 48-20 

447033 AI357412 Hs.157601 ESTs 22 & 23 21 & 22 

439608 AW864696 Hs.301732 hypothetical protein MGC5306 24-28 23-27^ 

449032 AA045573 Hs.22900 nuclear factor (erythrokl-derived 2Hik 29& 3028-&-29 

442577 AA292998 Hs.163900 ESTs 31 & 32 30 & 31 

429970 AK000072 Hs.227059 chloride channel, calcium activated, fam 33 & 34 32 & 33 

424566 IVI16801 Hs.1790 nuclear receptor subfamily 3, group C. m 35 & 36 34 & 35 

457407 AA505035 Hs.345911 ESTs 37 3$ 

430378 Z29572 Hs.2556 tumor necrosis factor receptor superfami 38 & 39 37-^-38 

417332 AW972717 Hs.288462 hypothetical protein FU21 511 40&413&^-40 



Table 26 



SEP ID NO : 1 Ccq ID NO ; — 3r DNA sequence 

Nucleic Acid Accession #: see Table 25 & 25A for complete list 

1 11 21 31 41 51 

I i I I I I 

CAATATAGTA CAATAACTAT TTGCATGACA TTTACATCGG ATATTATGAG TGATCTAGAG 60 

TTGATATGAA GTATATGGGA GGATGTGCAA AGGTGATGTG CAAATACTAT GTCATTTTAT 12 0 

AGGGGGGACT TGAGTATCCT TTGTTACCCT CAGGAGATCC TGAAACCAGT CCCCCATGGA 180 

TACTGAGGGC TGACTGTATA GTCCTATCCT CACGGAACTT TCATTCTAAT GGGGGAAGAC 240 

TGACTATAAA CAAAATATAT GTAATAGGTG GTGGTAAGTA CCGTGGAGAA GTAACAAATG 300 

GGGCAAAGTG AGTTATACAG CTCCATTCTT AGAAACCTTG GAGTACTTTT CTTAGTTTAT 360 

ACTCGTGGTG GTTTCCTTTT GTCTCCTTTA TTACATGGGA CTCTGACATG TGCCCATAGC 42 0 

TAGGGTGACA GTAGGATCTA CCCGATAGTA GGGTGGCAGT AGGATCTACC CAAAAAGCGT 4 80 

CCTGCTGATA CAGGACCAAA GCATCCTGTT GTTCTCGAGC CTATAAAAAG AGCTAATGGT 54 0 

GTTGCTTCTC TTAACTGTGG CCTCCTACAC TGTGTTTTGG ATGATTGGTG ATGTCTTGGA 600 

TATTCTGTTT CTTTGGAACT TTGAATATAC AACACTTTAC TAGGGTU^TTA GCAATGGAAG 660 

CAGAGCAAAG ATGTACAGAG GAAACAATGC GTAACTCTGA TGGAATTGAA GTCATGAGGC 72 0 

AGCAGAGAGC TTAAATTACA GCTTTAAAAA TTTTTATTTT TTAGAGGGAA TTTACTTGGG 78 0 

AGTAACAGCA GTAATAGTTA ACGGAGCCAG AATGCTTGAG TCATATAATT GCAAAGCAGA 84 0 

GTTGGGAGCA ACAGATGCTA AAGAGTAGTT GCTGTAGTTC CTCTTTGGGT CGTAGGAGCA 900 

GTTGTCATAT TACTATATAG CTACTGCATG AAGAAGAGTT CTTAGTGAGG CCTGGGTGAA 960 

CAGCTCTTCT TAGTATTCTG TGTGACCCCA TTTGACCTTT TAACAAATCC CTAAGTAAAT 102 0 

AAATAGCCCC TCAGGAAAAC TAAGTTTTTC TCTGCTGTTT TTTTGCTTGA GAGAGCTATA 1080 

ACTGTAATAG ACTTATATTT CTGAACATTT TAGTGCTTGC CAATATTTGG TAATATTTAT 114 0 

GTTTCCTATA TTTGTAATGA ACATTCTTCT TCCGGTACAT TTTTTGTTAA ATTATTGTTT 1200 

GATGGATAAA AGTTCACCTT TTATTGTATA AAATTGACTG AGATTAATTT ATACACATTG 12 60 

ACAATGGGTA AATAGAATTT TTCAGATTAT TAAAAGCTGA AGGATGACCA CGTAAGCAAA 132 0 

AAAAAAAAAA AAAAAACCAA CAAAAATAAA CCCAAACCCC TCAAACAATT TCGAACACGA 13 80 

AACATTCTTC TGATGCCGGC ATCCCTGCTT GCAGGTGTGA AGGGGGCAGG AATCAGCGAG 144 0 
GTGTCCTGGG CTGAGTCCCC GGGGAAGAAT ATGAT 

SEP ID NO: 2 Dcq ID NO ! 2 DNA sequence 
Nucleic Acid Accession #: X83301.1 

1 11 21 31 41 51 

I I I I I ' 

GCAAAGCCAG CTGGGCTCCT GAGTCCGGTG GGTACTTGGA GAACTTACTA CGTCTAGCTG 60 

GAGGATTGTA AATGCACCAA TCAGCATGCT GTGTCTAGCT CAAGATTTTC TCCATCCCCT 12 0 

TATTTTGGGC CAGTGGCTGT CATTACATAT GAGATGAGTC TCTTGAAGAC TACAGATGAA 180 

CTCAAGCTCC ATGAGGAGAT GTTTCATTGT CGAGAGCAGT CATGATGGCC TGCACTCCAC 24 0 

ACAATGCAAC AGAGTGAAAG AGCAGGTTCT GCTTCTTTGG TGTAGTCCTG AAGCTTCCTA 300 

AGAAACTTCA CATCAGGTGA TGGATAGGAG CAACCCTGTA AAACCAGCCT TAGACTATTT 360 

TTCAAACAGG CTGGTGAATT ACCAGATCTC CGTCAAGTGC AGTAACCAGT TCAAGTTGGA 42 0 

AGTGTGTCTT TTGAATGCAG AGAACAAAGT CGTGGACAAC CAGGCTGGGA CCCAGGGCCA 4 80 

GCTGAAGGTG CTGGGTGCCA ACCTCTGGTG GCCGTACCTG ATGCACGAAC ACCCCGCCTA 54 0 

CCTGTACTCC TGGGAGGATG GTGATTGCTC ACACCAAAGC CTTGGACCCC TCCCAGCCTG 600 

TGACCTTTGG GACCAACTCC ACCTACGCAG CAGACAAGGG GGCTCTGTAT GTGGATGTGA 660 

TCCGTGTGAA CAGCTACTAC TCTTGGTATC GCAACTACGG GCACCTGGAG TTGATTCGGC 72 0 

TGCAGCTGGC CGCCCAGTTT GAGAATTGGT GTGAGACATC ACAATCCCAT TATTCAGAGC 780 

GCGTATGGAG TGGAAACGCT TGTAGGGTTT CACCAGGGCT GGTGAATTAC CAGATCTCCG 84 0 

TCAAGTGCAG TAACCAGTTC AAGTTGGAAG TATGTCTTTT GAATGCAGAA AACAAAGTCG 900 

TGGACAACCA GGCTGGGACC CAGGGCCAGC TGAAGGTGCT GGTGCCAACC TCTGGTGGCC 960 

GTACCTGATG CACGAACACC CCGCCTACCT GTACTCGTGG GAGGATGGTG ATTGCTCACA 102 0 

CCAAAGCCTT GGACCCCTCC CAGCCTGTGA CCTTTGGGAC CAACTCCACC TACGCAGCAG 1080 

ACAAGGGGGC TCTGTATGTG GATGTGATCC GTGTGAACAG CTACTACTCT TGGTATCGCA 114 0 

ACTACGGGCA CCTGGAGTTG ATTCGGCTGC AGGCCCTGCA GCTGGCCGCC CAGTTTGTGA 1200 

ATTGGTGTAA GACATCACAA TCCCATTATT CAGAGCGCGT ATGGAGTGGA AACGCTTGTA 1260 



GGGTTTCACC AGTCTTTCCC AGGGAACTCC GATGAAGTGT TCCAACAAAA TGAGCGAGTG 132 0 

AACCAAGAAG AGGATGACAT TAGATCCAGG AGATACAACA GAGGAGATAA TCTCCAGGAT 1380 

GCCTGTGAAG AAAGATCCCT GGATCCCAGG ATGATTATAG GACAAGTTGT TCATAATCCA 144 0 

GCAGGCCAGA AGACTTCCAG GGAAACTCAT TTCAAGATGA AAATGGACCA GCCGCAGTGG 150 0 

CTCACGCCTG TAATACCAGC ACTTTGGGAG GCTGAGGCGG GCGGATCACT TGAGGTCAAG 1560 

AGTTTGAAAC TAGCCTGGCC AACGTGGCAA AACTCCATCT CTATTAAAGA TACAAAAATT 162 0 

AGCCAGGCAT AGTGGTGCAT GCCTGTAGTC CCAGCTACTT GGGATGCTGA GGCAGGAAGA 1680 

ATTGCTTGAA CCTGGGAGGC AGAGTCTGCG GTGACCGAGA TCATGCCACT GCACTCCAGC 1740 
CTGGGTGACA GAGCCAGACT CCGTCTCTAC TAAAAAAAAA AAAAAAAAAA AAA 



SEP ID NO: 3 Gcq ID NO; 3 Protein sequence: 
Protein Accession #: CAA58280.1 

1 11-21 31 41 51 

I I I I I I 

MDRSNPVKPA LDYFSNRLVN YQISVKCSNQ FKLEVCLLNA ENKWDNQAG TQGQLKVLGA 

NLWWPYLMHE HPAYLYSWED GDCSHQSLGP LPACDLWDQL HLRSRQGGSV CGCDPCEQLL 
LLVSQLRAPG VDSAAAGRPV 



SEP ID NO: 4 Ccq ID NO; 4 DNA sequence 
Nucleic Acid Accession #: BC002622.1 

1 11 21 31 41 51 

I I I i i I 

GGCACGAGGC TCCGCCCGCG GCCGGGATGC ACTAGGCAAA GCCAGCTGGG CTCCTGAGTC 60 

CGGTGGGTAC TTGGAGAACT TACTACGTCT AGCTGGAGGA TTGTAAATGC ACCAATCAGC 12 0 

ATGCTGTGTC TAGCTCAAGA TTTTCTCCAT CCCCTTATTT TGGGCCAGTG GCTGTCATTA 180 

CATATGAGAA CTCAAGCTCC ATGAGGAGAT GTTTCATTGT CGAGAGCAGT CATGATGGCC 24 0 

TGCACTCCAC ACAATGCAAC AGAGTGAAAG AGCAGGTTCT GCTTCTTTGG TGTAGTCCTG 3 00 

AAGCTTCCTA AGAAACTTCA CATCAGGTGA TGGATAGGAG CAACCCTGTA AAACCAGCCT 360 

TAGACTATTT TTCAAACAGG CTGGTGAATT ACCAGATCTC CGTCAAGTGC AGTAACCAGT 42 0 

TCAAGTTGGA AGTGTGTCTT TTGAATGCAG AAAACAAAGT CGTGGACAAC CAGGCTGGGA 4 80 

CCCAGGGCCA GCTGAAGGTG CTGGGTGCCA ACCTCTGGTG GCCGTACCTG ATGCACGAAC 54 0 

ACCCCGCCTA CCTGTACTCG TGGGAGGATG GTGATTGCTC ACACCAAAGC CTTGGACCCC 600 

TCCCAGCCTG TGACCTTTGT GACCAACTCC ACCTACGCAG CAGACAAGGG GGCTCTGTAT 660 

GTGGATGTGA TCCGTGTGAA CAGCTACTAC TCTTGGTATC GCAACTACGG GCACCTGGAG 72 0 

TTGATTCAGC TGCAGCTGGC CGCCCAGTTT GAGAATTGGT GTAAGACATC ACAATCCCAT 780 

TATTCAGAGC GCGTATGGAG TGGAAACGCT TGTAGGGTTT CACCAGTCTT TCCCAGGGAA 84 0 

CTCCGATGAA GTGTTCCAAC AAAATGAGCG AGTGAACCAA GAAGAGGATG ACATTAGATC 900 

CAGGAGATAC AACAGAGGAG ATAATCTCCA GGATGCCTGT GAAGAAAGAT CCCTGGATCC 960 

CAGGATGATT ATAGGACAAG TTGTTCATAA TCCAGCAGGC CAGAAGACTT CCAGGGAAAC 102 0 

TCATTCAAGG AGGTGAAAAT GATGGATGAC TCCTCCAAGA TGAAAATGGA CCAGCCGCAG 1080 

TGGCTCACGC CTGTAATACC AGCACTTTGG GAGGCTGAGG CAGGCGGATC ACTTGAGGTC 114 0 

AGGAGTTTGA AACTAGCCTG GCCAACGTGG CAAAACTCCA TCTCTATTAA AAATACAAAA 12 00 

ATTAGCCAAG CATAGTGGTG CATGCCTGTA GTCCCAGCTA CTTGGGATGC TGAGGCAGGA 12 60 

AGAATTGCTT GAACCTGGGA GGCAGAGTCT ACAGTGAGCC GAGATCATGC CACTGCACTC 132 0 
CAGCCTGGGC AACACAGTGA GACTCCATCT CAAAAAAAAA AAAAAAAAAA AA 



SEP ID NP:5 Ccq ID NO: 5 Protein sequence: 
Protein Accession #: AAH02622.1 

1 11 21 31 41 51 

I I I I I i 

MDRSNPVKPA LDYFSNRLVN YQISVKCSNQ FKLEVCLLNA ENKWDNQAG TQGQLKVLGA 60 
NLWWPYLMHE HPAYLYSWED GDCSHQSLGP LPACDLCDQL HLRSRQGGSV CGCDPCEQLL 12 0 
LLVSQLRAPG VDSAAAGRPV 



SEP ID NO ; 6 Ccq ID NO ; — 6- DNA sequence 

Nucleic Acid Accession #: see Table 25 & 25A for complete list 



1 11 21 31 41 51 

I I I I I I 

ACCTGAGATC AGGAGTTCGA GATCAGCCTG ACCAATAGGG TGAAACCCCG TCTCTACTAA 60 

AAATACAAAA AATTAGCTGG ACACGATGGT GGGTGCCTGT GGTCCCGGCT ACTCGGGAGG 120 

CTGAGACAGG AGAATCAGTT GACCTGGGAG TTGGTGGTTG CAGTGAGCTG AGATCACACC 180 

ATTGCATTCC AAGCCTGGGC AACAAGAGTG AAACTCCATC GCAAAAAAAA AAAAGAAGGG 240 

GCATAATTTG TGGATGAGGA TTGGATATAA GGTAAAGGAT GGGACATTCT TGGACTTACA 3 00 

GATGGTGTGA TTGCCTGGCT AGAAGAAGAA TTCCCGGTCA AAAAGAAACC ATCAGCTTTC 3 60 

CAAGTGTGAA AGAGAGATAA ATCTGTGAAG ATTATAGGGA CTACAGGAAA CTTAATCTTT 420 

TTCTTTGAAA AAGCAATTGT AGCAAAAAAA AAGAAAATTT CTTACTGTCA TCTAAAATTG 4 80 

ACATGGACAT CTTAGTGGAC TAGAAGTTAA GGGCATAAAT TCTCCCAGTG ATTTTTAATT 54 0 

TTAGCATTGT GATTAACACC TTCTAAAATT GCCAGAACTT AATAAATAAT TGCTTTTCAT 600 

TATTAGTATG CCATCAAATT TAGTAGCTGT TTCAGGCTTT AATGTGTCAA GCCTAAAATC 660 

CAGATTTTTG AGGATCTTCT CCCTCTTAAA AGAGTATTCA GTTAACTGCC GTAGAAATAC 720 

ACATGTATAC AAGGGCACTG TATACATCAG TCTAAAAAAT AAAAATATGT ATACGTTCTG 7 80 

GTGAGTCTAG CACAGCATTG CCCAATAGAA ATACCAATGG AGGTCACAAA TGTGGCCCAT 84 0 
ATAGGTTAAT TGGTAAATTT TCTNATAGNC ACC 



SEP ID NO : 7 Ccq ID NO ; — 7- DNA sequence 
Nucleic Acid Accession #: AK000942 
Coding sequence: 12 04-1503 

1 11 21 31 41 51 

I I I I I I 

GTAAAGGAAT GTCTTTTTAA TTCAGCTTTT CTTTTCTCCA TGCTAGTGTT ATCAGGTTTT 60 

GGTATTTATT TACTTACAGC ATATGTTATG AAGCTGGTTT GAAAATTGGT TTTAGATATA 12 0 

TCTGCAAGTT TACTACTTTG ACTGTAAAAA AAAAAAATGA AAAAGTAGTT GACATCTGTC 180 

CTCAGAAGAA GTTTGCAGGT TGCATATTTG TGTGTAAATA CACAGGCTAA AAGGTAATTT 240 

ATGTTCCTTG GGAATTGAAA TGGTCAGTGG CCCGTTACAG AAACTTATCA GTCATATATC 3 00 

AGCACCAGTT CATTCTTTTG CACCTTAGGG ACCATCTGTC CCCTGAGGTG ACCTGAGAAA 3 60 

CAACCAGTTG CCCACAGACT GTTATTTCTT CAAGTGAGCC AGGATTTGAT TTCACTGCCT 42 0 

TATATTCTAT TTTTAGTGTA CAGTGCTTTG ATTTTTTGGA AAAACTAAAT TTTAAACATA 4 80 

TTTGAAAAAT GTTATAAGAC TTGGACATTA AGTCTGTTGA TAGCCAAAGT CAGTTTACCA 54 0 

AAGTAAAACA AATAAATTCT ATGCTTCTTC ATTGTCAAAG AGCAGTCTGC CATCATGTGG 600 

ATATAAATGG ACTATGTAAA GTGACATGGT GCTTACTCTC TACCTAATAA TAGCCTCCCT 660 

CCTGTTCCAA CAAGATAACC AACAGGTATA TTTAATTTAC CAGTTAATAT GTTTTGGATA 72 0 

ATTGGCTGCC TTGAAATGCT ATATGTTTTA TAGTACATCA TAGCTTTAGT TTTCTTCATA 780 

AGGAAATTAC AGTTACATCC TGGCTAACAT GGTGAAACTC CATCTCTACT AAAAATACAA 840 

AAAATTAGCC GGGCGTGGTG GCGGGCACTT GTAGTCCCAG CTACTCGGGA GGCTGAGGCA 900 

GGAGAATGGC GTGAACCCAG GAGGCGGAGG TTGCAGTGAG CCGAGATCGT GCCACTGTAC 960 

TCTGGCCTGG GAGACAGAGC GAGACTCCAT CTCAAAAAAA AAAAAAAAAA AAAAAAAAGA 102 0 

GAGAGAGAGA CCTGGAGTAG AGATTCTGTC AAAGAACTTT TTCTTTCTTG AGAAGCATCT 1080 

GAAATGGAAT CTGTTGTCTC TTCGAAATAT GTACTGCTGT AACAGTGAAA CAACCCTCAG 114 0 

AGTATGCCTT CGTGTGGGCT ACTCGTTGTG GTTTTGAACT TGGGGGAACT GTCTGTGTTT 12 0 0 

GGGTCAAGAA TATGCAACTG GCTGGGCACA TTGGCTCACG CCTGTAATCC CAGCAATTTG 1260 

GGAGGCTGAG GCAGGCGGAT CACCTGAGGT CAGGGCTTCA AGACCAGACT GGCCAACATG 1320 

GTGAAACCCC GTCTCTACTG AAAATACAAA AATTAGCTGG GCATGGTGGC AGGTGCCTGT 13 80 

AATCCCAGCT ACTCGGGAGG CTGACGTGAG AGAATCGCTT GAACCCGGGA GTTGGAGGTT 144 0 

GCAGTGAGCC GAGATTGCAC CATTGCACTC CAGCTTGGGC AACAAGAGTG AAACTCTTGT 1500 
CTCAG 



SEP ID NO: 8 Ccq ID NO : 8 DNA 
Nucleic Acid Accession #: 



sequence 

see Table 25 & 25A for complete list 



1 11 21 31 41 51 

I I I I I I 

GACTAGGCTG GGCAACATAG TGAGACCTCA TCTCTAAAAT TAAAAAAATA AAAGCCACCA 60 

GAAAAAAACC TAAAAACATG CCAAGTGACA TCAGTCTTTG ATGAAAATGG CAGCAGAAGA 120 

GTGATGCCAT GGGTGGGGGT GGGAAATGCT ATTTCAGCAG AGAGGGAGCT GTCATGGAAG 180 

ACACCATGTG GCTGGGCACG GTGGCTCACA CCTGTAATCC CAGCACTTTG GGAGATAGAG 24 0 

GCAGGTGGAT CCCTTGAGCT TAGGAATTTG AGACTAGCCT GGGCAATAAG AGTGAAACTC 3 00 

CATCTCAAAA AAAAAAAAAA AAAAAGGTGC ATGAAACATA TGAAGCAAAA AGTGAAAGTC 3 60 

CCCATTCTTT TCCTTTTTCC AGAGGTGATT TTTGTGGCCA ATCTGGTTTC ATTCCCTCCC 42 0 

AGACACTTTT CTAGGCATCT ATGCGCCTCT ATTCACATAT AAACAAAATA GGAGTTTTCC 4 80 

TGTGCTTCCC TTAAATGGCA TATGTATCTT TCACTCTTTT TTTTCACCTA GTGGATCTTT 54 0 

AATACCTTAA AAGCTCAACC TGGGCTTGGT GCGGTGGCTC ATACGTGTAA TCCCAGGCCT 600 

TTGGGAGGCC AAGGTGGGAG GATCACTTGA GCTCAGGAGT TCCAGACCAT TCCAAAGCAA 660 

AAACAAAAGG ATTTTGAGAT CAGTGTGGGC AACTTAGCAA AACACCATCT CTTAAAAAAA 720 
AAAAAAAAAA 



SEP ID NO ; 9 Ccq ID NO: — 9- DNA sequence 
Nucleic Acid Accession #: BC010433.1 
Coding sequence: 3-335 

1 11 21 31 41 51 

I I I I I I 

GGTCGCCCTC CGTCGTGGTC TGGCGTGTAT TCCGAGCCTT GGTGTCTGGC GGTTTCCGAG 60 

CGTTGGTGTC TGGCGGTTTC CGAGCGTTGG TGTCTGGCGG TTTCCGACCG TTGGTGTCTG 12 0 

GCGGTTTCCG ACCGTTGGTG TCTGGCACGC GCCACCCTCT CTTGCTTTGG TTGCGCCATG 18 0 

CCGATGTACC AGACAAGAAG ACAAGAAAAT GATTTGAGGA CAGCTTCAAT CGCGGTGTGA 24 0 

AGAAGAAAGC AGCAAAACGA CCACTGAAAA CAACGCCGGT GGCAAAATAT CCAAAGAAAG 3 00 

GGTCCCAAGC GGTACATCGT CATAGCCGGA AACAGTCAGA GCCACCAGCC AATGATCTTT 3 60 

TCAATGCTGC GAAAGCTGCC AAAAGTGACA TGCAGCACCG AGAAGTCCGC GTGAAGTGCG 42 0 

TGAAGGCTCT GAAAGGGCTG TACGGTAACC GGGACCTGAC CGCACGCCTG GAGCTCTTCA 480 

CTGGCCGCTT CAAGGACTGG ATGGTTTCCA TGATCATGGA CAGAGAGTAC AGTGTGGCAG 54 0 

TGGAGGCCGT CAGATTACTG ATACTTATCC TTAAGAACAT GGAAGGGGTG CTGATGGACG 600 

TGGACTGTGA GAGCGTCTAC CCCATTGTGT AGGCCTCTAA TTGAGGCCTG GCCTCTGCTG 6 60 

TGGGTGAATT TCTGTACTGG AAACTTTTCT ACCCTGAGTG CGAGATAAGA ACGATGGGTG 72 0 

GAAGAGAGCA ACGCCAGAGC CCAGGTGCCC AGAGGACTTT CTTCCAGCTT CTGCTGTCCT 780 

TCTTTGTGGA GAGCAAGCTC CACGACCACG CTGCTTACTT AGTAGACAAC CTGTGGGACT 84 0 

GTGCAGGGAC TCAGCTGAAG GACTGGGAGG GTCTGACAAG CCTGCTGCTG GAGAAGGACC 900 

AGAGCACGTG CCACATGGAG CCAGGGCCAG GGACCTTCCA CCTCCTAGGG TGAAACCAGG 960 

AGAGATTGCT TGCTTCACTT GTACAAGGCA GGAACGGTGG CATGGGGTGG GGGAAACTTG 102 0 

GAGTTGGAAG GTGGCTAATC TTTGATTCTA TGTTTTTGAT CCTCCTGGCA CTCCAGACCT 1080 

GGGTGATGTG CAGGAGAGCA CACTGATAGA AATCCTTGTG TCCAGTGCCC AGCAACTCCT 114 0 

GCCTCAGCCT CCCGAGCAGC TGGGACTACA GGCGCCCGCC ACCACGCCTG GCTAACTTTT 12 00 

TTGTGTTTTT AGTAGAGACG GGTTTTCACC GTGTTGGCCA GGATGGTCTT GATCTCTTGA 12 60 

CCTTGTGATC CACCTGCCTC ATCATCCCAA AGTGCTGGGA TTACAGGCGT GAGCCACTGC 13 2 0 

GCCCAGCATG TTAGACAATT TTTAATTCAT CCTCTCTGTG CTGTTGTTTT CTCAGCTGTG 13 80 

AAAGGAATAT TCTGGTGGGG ACAAGGTTAC AGAGTTGCTG AGAGGGTCTC ATGACATGAA 144 0 

GGTACTGGCC TTGGCACAGT GCCTGGGGGG GCGGGGACTC CGCACATGCC TGTGATGTCA 1500 

CAGTTACTGT CAGTTCACAG CGAACCTTCC CTCCTTTTCC TGTTGACTTT CCCACACTCC 1560 

TGTAACCCTC CCTCCCTCCC TTCTTCCTCT CTCTCTCTCT CACTCACGCA CACGCACACA 162 0 

CACACACACA CACACACACA CACACACTCC ATTCACTGTC TCCATGACTC TGGAGTAAAC 1680 

TAACGTCTCG AGTTGCCATT GGAAGCCCCG TTGTCCTCAT TTAGACTTTC ATGGGTTATA 174 0 

GGCACTTTTG ACTTCCTGGG GTCCTTCTTC AGTTAAAAAA AAAAATTAGA AAATTAGGCC 18 00 

GGGCGTGGTG GCACATGCCT GTAATCCCAG CACCTTGGCC TCCCAAAGTG CTGGGATTAC 1860 

AGGAGTGAGC CACCATGCCC AGCCTCCGTT GTCCTCATTT AGACTTTCAT GGGTTATAGG 192 0 
CACTTTTGAC TTCCTGGGGT CCTTCTTCAG TTAAAAAAAA AAAAAAAAAA 



SEP ID NO; 10 Gcq ID NO; 10 DNA sequence 

Nucleic Acid Accession #: see Table 25 & 25A for complete list 



1 11 21 31 41 51 

I I I I I I 

AGTGGNTCCC CCGGNCTGCA GGAATTCGGC ACGAGATCAT GATGGCTAAT ATTTCCTGAG 60 

CACCTTTCAT TCAGGCATGA TGCCAGGTGC ACCAACTTAC TTAATCCTCA TAGCCACCAC 12 0 

CTGAGCAAGC TCCTGTTTTA TAAATGGACC AGTTCTTGTT GCTGTTGTAC AAGTTATTTT 180 

CTTTCTATAA CGTCCTCCTT GTCCTCCTTC CACATTCTTA AAGAAACTTT CCCTTCCTTT 24 0 

AAAGTACTCA GGGAGCCCTG CATTGCTTCT TGAAGCCTTC TCCAGCTTCA TCATCTCACA 3 00 

GTGGTCTCTC TTTTCACTAA ATGTCCAATA TGCTGCACAT AAGTACCCCA AAGTTAGCAC 360 

AGGAATTGTT CCATGGCTGT CATATATGTT AAAAATCATT AAAAGTTCAT TTTTTCTCTC 42 0 

ATTATGGGAA GGATACATGC TCCTACTAGT AAATTTAGTA GGTAGAAAAA AATTATCACT 480 

ATCTAGACTG CTTTCCATTT AGTCTTTATG CATAGCTTTC GTGTCTGCCT ATTTTTACCT 54 0 

TGTGTTTGTA ACTTACTATT ATAAAATATG CGTCTCTATG TTCATTGTCA ACGATTATTT 600 

ACAATAACAT GGAGTGGATT TACATGTATT CTCTATATTT GGATTAAAGG AGATAGAGTA 660 

TGTGAAATTA AATGGGAGAA GTATCTGATA CATAACAGGC AATACAAATA TTATCACATA 72 0 

GCGTCAATTT ATTTGTGAAT ATTGAAAGCT CCAAAAAAGA AAAAAAGTTT TTTTTTAATT 780 

CCCGTAATTA CTTATTGCAG TATTGTGTTC ATACAAACTG CTCAGTCATT TTGGAGAAAT 84 0 

AACAATTTTT TTCCTCATCA TGAAGTAAGG TATGCTCACT GCAAAAAAAA TCTAGAAAAT 90 0 

AAAGAGGAAC ATGCTAAAGA AAAGAATACT CCCATATAAT CTCTGTCTTC ATAAATAATC 960 

TTTTGTAACG CTTATACACT GCTGGTGGGA ATGTAAATTA GTTCAGCCAT TGTG7U\AAGT 102 0 

AGCGTAGCAA TTCCTTGAAA AACTTAAAAT AGATTTACCG TTCAACCCAG CAATCCCATT 10 80 

ATTGGGCATA TACCCAGTGG AATGTAAATC ATCCTGCCAT AAAAACACAT GCACATGTAT 114 0 

GTTCATTGCA GCACTATTCA CAATAGCAAA GACATGGAAT CAACCTATAT GCCCATCAAT 12 00 

AGTAGACTGA ATAAAGAAAA TATGGTACAT ATTCACCACA GAATACTAAG CAGCCATAAA 1260 
AAAAAA 



SEP ID NO: 11 Ccq ID NO; 11 DNA sequence 
Nucleic Acid Accession #: NM_000582.1 
Coding sequence: 88-990 

1 11 21 31 41 51 

I I I I I I 

GCAGAGCACA GCATCGTCGG GACCAGACTC GTCTCAGGCC AGTTGCAGCC TTCTCAGCCA 60 

AACGCCGACC AAGGAAAACT CACTACCATG AGAATTGCAG TGATTTGCTT TTGCCTCCTA 12 0 

GGCATCACCT GTGCCATACC AGTTAAACAG GCTGATTCTG GAAGTTCTGA GGAAAAGCAG 180 

CTTTACAACA AATACCCAGA TGCTGTGGCC ACATGGCTAA ACCCTGACCC ATCTCAGAAG 24 0 

CAGT^TCTCC TAGCCCCACA GACCCTTCCA AGTAAGTCCA ACGAAAGCCA TGACCACATG 3 00 

GATGATATGG ATGATGAAGA TGATGATGAC CATGTGGACA GCCAGGACTC CATTGACTCG 360 

AACGACTCTG ATGATGTAGA TGACACTGAT GATTCTCACC AGTCTGATGA GTCTCACCAT 42 0 

TCTGATGAAT CTGATGAACT GGTCACTGAT TTTCCCACGG ACCTGCCAGC AACCGAAGTT 48 0 

TTCACTCCAG TTGTCCCCAC AGTAGACACA TATGATGGCC GAGGTGATAG TGTGGTTTAT 54 0 

GGACTGAGGT CAAAATCTAA GAAGTTTCGC AGACCTGACA TCCAGTACCC TGATGCTACA 60 0 

GACGAGGACA TCACCTCACA CATGGAAAGC GAGGAGTTGA ATGGTGCATA CAAGGCCATC 660 

CCCGTTGCCC AGGACCTGAA CGCGCCTTCT GATTGGGACA GCCGTGGGT^ GGACAGTTAT 72 0 

GAAACGAGTC AGCTGGATGA CCAGAGTGCT GAAACCCACA GCCACAAGCA GTCCAGATTA 780 

TATAAGCGGA AAGCCAATGA TGAGAGCAAT GAGCATTCCG ATGTGATTGA TAGTCAGGAA 84 0 

CTTTCCAAAG TCAGCCGTGA ATTCCACAGC CATGAATTTC ACAGCCATGA AGATATGCTG 900 

GTTGTAGACC CCAAAAGTAA GGAAGAAGAT AAACACCTGA AATTTCGTAT TTCTCATGAA 960 

TTAGATAGTG CATCTTCTGA GGTCAATTAA AAGGAGAAAA AATACAATTT CTCACTTTGC 102 0 

ATTTAGTCAA AAGAAAAAAT GCTTTATAGC AAAATGAAAG AGAACATGAA ATGCTTCTTT 1080 

CTCAGTTTAT TGGTTGAATG TGTATCTATT TGAGTCTGGA AATAACTAAT GTGTTTGATA 114 0 

ATTAGTTTAG TTTGTGGCTT CATGGAAACT CCCTGTAAAC TAAAAGCTTC AGGGTTATGT 12 0 0 

CTATGTTCAT TCTATAGAAG AAATGCAAAC TATCACTGTA TTTTAATATT TGTTATTCTC 1260 

TCATGAATAG AAATTTATGT AGAAGCAAAC AAAATACTTT TACCCACTTA AAAAGAGAAT 132 0 

ATAACATTTT ATGTCACTAT AATCTTTTGT TTTTTAAGTT AGTGTATATT TTGTTGTGAT 13 80 

TATCTTTTTG TGGTGTGAAT AAATCTTTTA TCTTGAATGT AATAAGAATT TGGTGGTGTC 144 0 

AATTGCTTAT TTGTTTTCCC ACGGTTGTCC AGCAATTAAT AAAACATAAC CTTTTTTACT 1500 
GCCTAAAAAA AAAAAAAAAA AAAA 



SEP ID NO; 12 Ccq ID NO ; — ^ Protein sequence: 
Protein Accession #: NP 000573.1 



1 11 21 31 41 51 

I I I I I i 

MRIAVICFCL LGITCAIPVK QADSGSSEEK QLYNKYPDAV ATWLNPDPSQ KQNLLAPQTL 60 
PSKSNESHDH MDDMDDEDDD DHVDSQDSID SNDSDDVDDT DDSHQSDESH HSDESDELVT 12 0 
DFPTDLPATE VFTPWPTVD TYDGRGDSW YGLRSKSKKF RRPDIQYPDA TDEDITSHME 18 0 
SEELNGAYKA IPVAQDLNAP SDWDSRGKDS YETSQLDDQS AETHSHKQSR LYKRKANDES 24 0 
NEHSDVIDSQ ELSKVSREFH SHEFHSHEDM LWDPKSKEE DKHLKFRISH ELDSASSEVN 



SEP ID NO: 13 Ccq ID NO; — 3r9- DNA sequence 
Nucleic Acid Accession #: NM_001793 
Coding sequence: 71-2560 

1 11 21 31 41 51 

I I I I I i 

AAAGGGGCAA GAGCTGAGCG GAACACCGGC CCGCCGTCGC GGCAGCTGCT TCACCCCTCT 6 0 

CTCTGCAGCC ATGGGGCTCC CTCGTGGACC TCTCGCGTCT CTCCTCCTTC TCCAGGTTTG 12 0 

CTGGCTGCAG TGCGCGGCCT CCGAGCCGTG CCGGGCGGTC TTCAGGGAGG CTGAAGTGAC 18 0 

CTTGGAGGCG GGAGGCGCGG AGCAGGAGCC CGGCCAGGCG CTGGGGAAAG TATTCATGGG 24 0 

CTGCCCTGGG CAAGAGCCAG CTCTGTTTAG CACTGATAAT GATGACTTCA CTGTGCGGAA 3 00 

TGGCGAGACA GTCCAGGAAA GAAGGTCACT GAAGGAAAGG AATCCATTGA AGATCTTCCC 3 60 

ATCCAAACGT ATCTTACGAA GACACAAGAG AGATTGGGTG GTTGCTCCAA TATCTGTCCC 42 0 

TGAAAATGGC AAGGGTCCCT TCCCCCAGAG ACTGAATCAG CTCAAGTCTA ATAAAGATAG 480 

AGACACCAAG ATTTTCTACA GCATCACGGG GCCGGGGGCA GACAGCCCCC CTGAGGGTGT 54 0 

CTTCGCTGTA GAGAAGGAGA CAGGCTGGTT GTTGTTGAAT AAGCCACTGG ACCGGGAGGA 600 

GATTGCCAAG TATGAGCTCT TTGGCCACGC TGTGTCAGAG AATGGTGCCT CAGTGGAGGA 6 60 

CCCCATGAAC ATCTCCATCA TCGTGACCGA CCAGAATGAC CACAAGCCCA AGTTTACCCA 72 0 

GGACACCTTC CGAGGGAGTG TCTTAGAGGG AGTCCTACCA GGTACTTCTG TGATGCAGGT 780 

GACAGCCACG GATGAGGATG ATGCCATCTA CACCTACAAT GGGGTGGTTG CTTACTCCAT 84 0 

CCATAGCCAA GAACCAAAGG ACCCACACGA CCTCATGTTC ACCATTCACC GGAGCACAGG 90 0 

CACCATCAGC GTCATCTCCA GTGGCCTGGA CCGGGAAAAA GTCCCTGAGT ACACACTGAC 960 

CATCCAGGCC ACAGACATGG ATGGGGACGG CTCCACCACC ACGGCAGTGG CAGTAGTGGA 102 0 

GATCCTTGAT GCCAATGACA ATGCTCCCAT GTTTGACCCC CAGAAGTACG AGGCCCATGT 1080 

GCCTGAGAAT GCAGTGGGCC ATGAGGTGCA GAGGCTGACG GTCACTGATC TGGACGCCCC 1140 

CAACTCACCA GCGTGGCGTG CCACCTACCT TATCATGGGC GGTGACGACG GGGACCATTT 12 00 

TACCATCACC ACCCACCCTG AGAGCAACCA GGGCATCCTG ACAACCAGGA AGGGTTTGGA 12 60 

TTTTGAGGCC AAAAACCAGC ACACCCTGTA CGTTGAAGTG ACCAACGAGG CCCCTTTTGT 1320 

GCTGAAGCTC CCAACCTCCA CAGCCACCAT AGTGGTCCAC GTGGAGGATG TGAATGAGGC 13 80 

ACCTGTGTTT GTCCCACCCT CCAAAGTCGT TGAGGTCCAG GAGGGCATCC CCACTGGGGA 144 0 

GCCTGTGTGT GTCTACACTG CAGAAGACCC TGACAAGGAG AATCAAAAGA TCAGCTACCG 15 0 0 

CATCCTGAGA GACCCAGCAG GGTGGCTAGC CATGGACCCA GACAGTGGGC AGGTCACAGC 156 0 

TGTGGGCACC CTCGACCGTG AGGATGAGCA GTTTGTGAGG AACAACATCT ATGAAGTCAT 162 0 

GGTCTTGGCC ATGGACAATG GAAGCCCTCC CACCACTGGC ACGGGAACCC TTCTGCTAAC 16 8 0 

ACTGATTGAT GTCAATGACC ATGGCCCAGT CCCTGAGCCC CGTCAGATCA CCATCTGCAA 174 0 

CCAAAGCCCT GTGCGCCAGG TGCTGAACAT CACGGACAAG GACCTGTCTC CCCACACCTC 18 0 0 

CCCTTTCCAG GCCCAGCTCA CAGATGACTC AGACATCTAC TGGACGGCAG AGGTCAACGA 1860 

GGAAGGTGAC ACAGTGGTCT TGTCCCTGAA GAAGTTCCTG AAGCAGGATA CATATGACGT 192 0 

GCACCTTTCT CTGTCTGACC ATGGCAACAA AGAGCAGCTG ACGGTGATCA GGGCCACTGT 1980 

GTGCGACTGC CATGGCCATG TCGAAACCTG CCCTGGACCC TGGAAGGGAG GTTTCATCCT 204 0 

CCCTGTGCTG GGGGCTGTCC TGGCTCTGCT GTTCCTCCTG CTGGTGCTGC TTTTGTTGGT 2100 

GAGAAAGAAG CGGAAGATCA AGGAGCCCCT CCTACTCCCA GAAGATGACA CCCGTGACAA 2160 

CGTCTTCTAC TATGGCGAAG AGGGGGGTGG CGAAGAGGAC CAGGACTATG ACATCACCCA 222 0 

GCTCCACCGA GGTCTGGAGG CCAGGCCGGA GGTGGTTCTC CGCAATGACG TGGCACCAAC 22 80 

CATCATCCCG ACACCCATGT ACCGTCCTCG GCCAGCCAAC CCAGATGAAA TCGGCAACTT 234 0 

TATAATTGAG AACCTGAAGG CGGCTAACAC AGACCCCACA GCCCCGCCCT ACGACACCCT 24 00 

CTTGGTGTTC GACTATGAGG GCAGCGGCTC CGACGCCGCG TCCCTGAGCT CCCTCACCTC 2460 

CTCCGCCTCC GACCAAGACC AAGATTACGA TTATCTGAAC GAGTGGGGCA GCCGCTTCAA 252 0 

GAAGCTGGCA GACATGTACG GTGGCGGGGA GGACGACTAG GCGGCCTGCC TGCAGGGCTG 2580 



GGGACCAAAC GTCAGGCCAC AGAGCATCTC CAAGGGGTCT CAGTTCCCCC TTCAGCTGAG 2640 

GACTTCGGAG CTTGTCAGGA AGTGGCCGTA GCAACTTGGC GGAGACAGGC TATGAGTCTG 2700 

ACGTTAGAGT GGTTGCTTCC TTAGCCTTTC AGGATGGAGG AATGTGGGCA GTTTGACTTC 2 7 60 

AGCACTGAAA ACCTCTCCAC CTGGGCCAGG GTTGCCTCAG AGGCCAAGTT TCCAGAAGCC 2 820 

TCTTACCTGC CGTAAAATGC TCAACCCTGT GTCCTGGGCC TGGGCCTGCT GTGACTGACC 2 880 

TACAGTGGAC TTTCTCTCTG GAATGGAACC TTCTTAGGCC TCCTGGTGCA ACTTAATTTT 2 94 0 

TTTTTTTAAT GCTATCTTCA AAACGTTAGA GAAAGTTCTT CAAAAGTGCA GCCCAGAGCT 3 000 

GCTGGGCCCA CTGGCCGTCC TGCATTTCTG GTTTCCAGAC CCCAATGCCT CCCATTCGGA 3 060 

TGGATCTCTG CGTTTTTATA CTGAGTGTGC CTAGGTTGCC CCTTATTTTT TATTTTCCCT 3120 

GTTGCGTTGC TATAGATGAA GGGTGAGGAC AATCGTGTAT ATGTACTAGA ACTTTTTTAT 3180 
TAAAGAAACT TTTCCCAGAA AAAAA 



SEP ID NO : 14 Scq ID HQ ; — 3r4- Protein sequence: 
Protein Accession #: NP_001784.2 

1 11 21 31 41 51 

I I I I I I 

MGLPRGPLAS LLLLQVCWLQ CAASEPCRAV FREAEVTLEA GGAEQEPGQA LGKVFMGCPG 60 

QEPALFSTDN DDFTVRNGET VQERRSLKER NPLKIFPSKR ILRRHKRDWV VAPISVPENG 12 0 

KGPFPQRLNQ LKSNKDRDTK IFYSITGPGA DSPPEGVFAV EKETGWLLLN KPLDREEIAK 180 

YELFGHAVSE NGASVEDPMN ISIIVTDQND HKPKFTQDTF RGSVLEGVLP GTSVMQVTAT 24 0 

DEDDAIYTYN GWAYSIHSQ EPKDPHDLMF TIHRSTGTIS VISSGLDREK VPEYTLTIQA 3 00 

TDMDGDGSTT TAVAWEILD ANDNAPMFDP QKYEAHVPEN AVGHEVQRLT VTDLDAPNSP 3 60 

AWRATYLIMG GDDGDHFTIT THPESNQGIL TTRKGLDFEA KNQHTLYVEV TNEAPFVLKL 420 

PTSTATIWH VEDVNEAPVF VPPSKWEVQ EGIPTGEPVC VYTAEDPDKE NQKISYRILR 480 

DPAGWLAMDP DSGQVTAVGT LDREDEQFVR NNIYEVMVLA MDNGSPPTTG TGTLLLTLID 54 0 

VNDHGPVPEP RQITICNQSP VRQVLNITDK DLSPHTSPFQ AQLTDDSDIY WTAEVNEEGD 600 

TWLSLKKFL KQDTYDVHLS LSDHGNKEQL TVIRATVCDC HGHVETCPGP WKGGFILPVL 660 

GAVLALIiFIiL LVLLLLVRKK RKIKEPLLLP EDDTRDNVFY YGEEGGGEED QDYDITQLHR 720 

GLEARPEWL RNDVAPTIIP TPMYRPRPAN PDEIGNFIIE NLKAANTDPT APPYDTLLVF 780 
DYEGSGSDAA SLSSLTSSAS DQDQDYDYLN EWGSRFKKLA DMYGGGEDD 



SEP ID NO: 15 Coq ID NO i 15 DNA sequence 
Nucleic Acid Accession #: XM_051860.2 
Coding sequence: 261-4346 

1 11 21 31 41 51 

I I i I I I 

GAGCTAGCGC TCAAGCAGAG CCCAGCGCGG TGCTATCGGA CAGAGCCTGG CGAGCGCAAG 60 

CGGCGCGGGG AGCCAGCGGG GCTGAGCGCG GCCAGGGTCT GAACCCAGAT TTCCCAGACT 120 

AGCTACCACT CCGCTTGCCC ACGCCCCGGG AGCTCGCGGC GCCTGGCGGT CAGCGACCAG 180 

ACGTCCGGGG CCGCTGCGCT CCTGGCCCGC GAGGCGTGAC ACTGTCTCGG CTACAGACCC 24 0 

AGAGGGAGCA CACTGCCAGG ATGGGAGCTG CTGGGAGGCA GGACTTCCTC TTCAAGGCCA 300 

TGCTGACCAT CAGCTGGCTC ACTCTGACCT GCTTCCCTGG GGCCACATCC ACAGTGGCTG 360 

CTGGGTGCCC TGACCAGAGC CCTGAGTTGC AACCCTGGAA CCCTGGCCAT GACCAAGACC 420 

ACCATGTGCA TATCGGCCAG GGCAAGACAC TGCTGCTCAC CTCTTCTGCC ACGGTCTATT 4 80 

CCATCCACAT CTCAGAGGGA GGCAAGCTGG TCATTAAAGA CCACGACGAG CCGATTGTTT 54 0 

TGCGAACCCG GCACATCCTG ATTGACAACG GAGGAGAGCT GCATGCTGGG AGTGCCCTCT 600 

GCCCTTTCCA GGGCAATTTC ACCATCATTT TGTATGGAAG GGCTGATGAA GGTATTCAGC 660 

CGGATCGTTA CTATGGTCTG AAGTACATTG GGGTTGGTAA AGGAGGCGCT CTTGAGTTGC 720 

ATGGACAGAA AAAGCTCTCC TGGACATTTC TGAACAAGAC CCTTCACCCA GGTGGCATGG 78 0 

CAGAAGGAGG CTATTTTTTT GAAAGGAGCT GGGGCCACCG TGGAGTTATT GTTCATGTCA 84 0 

TCGACCCCAA ATCAGGCACA GTCATCCATT CTGACCGGTT TGACACCTAT AGATCCAAGA 900 

AAGAGAGTGA ACGTCTGGTC CAGTATTTGA ACGCGGTGCC CGATGGCAGG ATCCTTTCTG 960 

TTGCAGTGAA TGATGAAGGT TCTCGAAATC TGGATGACAT GGCCAGGAAG GCGATGACCA 1020 

AATTGGGAAG CAAACACTTC CTGCACCTTG GATTTAGACA CCCTTGGAGT TTTCTAACTG 108 0 

TGAAAGGAAA TCCATCATCT TCAGTGGAAG ACCATATTGA ATATCATGGA CATCGAGGCT 114 0 

CTGCTGCTGC CCGGGTATTC AAATTGTTCC AGACAGAGCA TGGCGAATAT TTCAATGTTT 1200 

CTTTGTCCAG TGAGTGGGTT CAAGACGTGG AGTGGACGGA GTGGTTCGAT CATGATAAAG 1260 



TATCTCAGAC TAAAGGTGGG GAGAAAATTT CAGACCTCTG GAAAGCTCAC CCAGGAAAAA 1320 

TATGCAATCG TCCCATTGAT ATACAGGCCA CTACAATGGA TGGAGTTAAC CTCAGCACCG 1380 

AGGTTGTCTA CAAAAAAGGC CAGGATTATA GGTTTGCTTG CTACGACCGG GGCAGAGCCT 144 0 

GCCGGAGCTA CCGTGTACGG TTCCTCTGTG GGAAGCCTGT GAGGCCCAAA CTCACAGTCA 1500 

CCATTGACAC CAATGTGAAC AGCACCATTC TGAACTTGGA GGATAATGTA CAGTCATGGA 1560 

AACCTGGAGA TACCCTGGTC ATTGCCAGTA CTGATTACTC CATGTACCAG GCAGAAGAGT 162 0 

TCCAGGTGCT TCCCTGCAGA TCCTGCGCCC CCAACCAGGT CAAAGTGGCA GGGAAACCAA 1680 

TGTACCTGCA CATCGGGGAG GAGATAGACG GCGTGGACAT GCGGGCGGAG GTTGGGCTTC 174 0 

TGAGCCGGAA CATCATAGTG ATGGGGGAGA TGGAGGACAA ATGCTACCCC TACAGAAACC 1800 

ACATCTGCAA TTTCTTTGAC TTCGATACCT TTGGGGGCCA CATCAAGTTT GCTCTGGGAT 1860 

TTAAGGCAGC ACACTTGGAG GGCACGGAGC TGAAGCATAT GGGACAGCAG CTGGTGGGTC 192 0 

AGTACCCGAT TCACTTCCAC CTGGCCGGTG ATGTAGACGA AAGGGGAGGT TATGACCCAC 1980 

CCACATACAT CAGGGACCTC TCCATCCATC ATACATTCTC TCGCTGCGTC ACAGTCCATG 204 0 

GCTCCAATGG CTTGTTGATC AAGGACGTTG TGGGCTATAA CTCTTTGGGC CACTGCTTCT 210 0 

TCACGGAAGA TGGGCCGGAG GAACGCAACA CTTTTGACCA CTGTCTTGGC CTCCTTGTCA 2160 

AGTCTGGAAC CCTCCTCCCC TCGGACCGTG ACAGCAAGAT GTGCAAGATG ATCACAGAGG 222 0 

ACTCCTACCC GGGGTACATC CCCAAGCCCA GGCAAGACTG CAATGCTGTG TCCACCTTCT 22 80 

GGATGGCCAA TCCCAACAAC AACCTCATCA ACTGTGCCGC TGCAGGATCT GAGGAAACTG 234 0 

GATTTTGGTT TATTTTTCAC CACGTACCAA CGGGCCCCTC CGTGGGAATG TACTCCCCAG 24 0 0 

GTTATTCAGA GCACATTCCA CTGGGAAAAT TCTATAACAA CCGAGCACAT TCCAACTACC 24 6 0 

GGGCTGGCAT GATCATAGAC AACGGAGTCA AAACCACCGA GGCCTCTGCC AAGGACAAGC 252 0 

GGCCGTTCCT CTCAATCATC TCTGCCAGAT ACAGCCCTCA CCAGGACGCC GACCCGCTGA 2580 

AGCCCCGGGA GCCGGCCATC ATCAGACACT TCATTGCCTA CAAGAACCAG GACCACGGGG 264 0 

CCTGGCTGCG CGGCGGGGAT GTGTGGCTGG ACAGCTGCCG GTTTGCTGAC AATGGCATTG 2 7 00 

GCCTGACCCT GGCCAGTGGT GGAACCTTCC CGTATGACGA CGGCTCCAAG CAAGAGATAA 2760 

AGAACAGCTT GTTTGTTGGC GAGAGTGGCA ACGTGGGGAC GGAAATGATG GACAATAGGA 2 82 0 

TCTGGGGCCC TGGCGGCTTG GACCATAGCG GAAGGACCCT CCCTATAGGC CAGAATTTTC 2880 

CAATTAGAGG AATTCAGTTA TATGATGGCC CCATCAACAT CCAAAACTGC ACTTTCCGAA 2 94 0 

AGTTTGTGGC CCTGGAGGGC CGGCACACCA GCGCCCTGGC CTTCCGCCTG AATAATGCCT 3000 

GGCAGAGCTG CCCCCATAAC AACGTGACCG GCATTGCCTT TGAGGACGTT CCGATTACTT 3060 

CCAGAGTGTT CTTCGGAGAG CCTGGGCCCT GGTTCAACCA GCTGGACATG GATGGGGATA 312 0 

AGACATCTGT GTTCCATGAC GTCGACGGCT CCGTGTCCGA GTACCCTGGC TCCTACCTCA 3180 

CGAAGAATGA CAACTGGCTG GTCCGGCACC CAGACTGCAT CAATGTTCCC GACTGGAGAG 324 0 

GGGCCATTTG CAGTGGGTGC TATGCACAGA TGTACATTCA AGCCTACAAG ACCAGTAACC 33 00 

TGCGAATGAA GATCATCAAG AATGACTTCC CCAGCCACCC TCTTTACCTG GAGGGGGCGC 33 60 

TCACCAGGAG CACCCATTAC CAGCAATACC AACCGGTTGT CACCCTGCAG AAGGGCTACA 342 0 

CCATCCACTG GGACCAGACG GCCCCCGCCG AACTCGCCAT CTGGCTCATC AACTTCAACA 34 80 

AGGGCGACTG GATCCGAGTG GGGCTCTGCT ACCCGCGAGG CACCACATTC TCCATCCTCT 354 0 

CGGATGTTCA CAATCGCCTG CTGAAGCAAA CGTCCAAGAC GGGCGTCTTC GTGAGGACCT 3 6 00 

TGCAGATGGA CAAAGTGGAG CAGAGCTACC CTGGCAGGAG CCACTACTAC TGGGACGAGG 3 66 0 

ACTCAGGGCT GTTGTTCCTG AAGCTGAAAG CTCAGAACGA GAGAGAGAAG TTTGCTTTCT 37 2 0 

GCTCCATGAA AGGCTGTGAG AGGATAAAGA TTAAAGCTCT GATTCCAAAG AACGCAGGCG 3 7 80 

TCAGTGACTG CACAGCCACA GCTTACCCCA AGTTCACCGA GAGGGCTGTC GTAGACGTGC 3 84 0 

CGATGCCCAA GAAGCTCTTT GGTTCTCAGC TGAAAACAAA GGACCATTTC TTGGAGGTGA 3 90 0 

AGATGGAGAG TTCCAAGCAG CACTTCTTCC ACCTCTGGAA CGACTTCGCT TACATTGAAG 3 960 

TGGATGGGAA GAAGTACCCC AGTTCGGAGG ATGGCATCCA GGTGGTGGTG ATTGACGGGA 4 02 0 

ACCAAGGGCG CGTGGTGAGC CACACGAGCT TCAGGAACTC CATTCTGCAA GGCATACCAT 4080 

GGCAGCTTTT CAACTATGTG GCGACCATCC CTGACAATTC CATAGTGCTT ATGGCATCAA 414 0 

AGGGAAGATA CGTCTCCAGA GGCCCATGGA CCAGAGTGCT GGAAAAGCTT GGGGCAGACA 42 00 

GGGGTCTCAA GTTGAAAGAG CAAATGGCAT TCGTTGGCTT CAAAGGCAGC TTCCGGCCCA 42 60 

TCTGGGTGAC ACTGGACACT GAGGATCACA AAGCCAAAAT CTTCCAAGTT GTGCCCATCC 432 0 

CTGTGGTGAA GAAGAAGAAG TTGTGAGGAC AGCTGCCGCC CGGTGCCACC TCGTGGTAGA 43 80 

CTATGACGGT GACTCTTGGC AGCAGACCAG TGGGGGATGG CTGGGTCCCC CAGCCCCTGC 444 0 

CAGCAGCTGC CTGGGAAGGC CGTGTTTCAG CCCTGATGGG CCAAGGGAAG GCTATCAGAG 45 00 

ACCCTGGTGC TGCCACCTGC CCCTACTCAA GTGTCTACCT GGAGCCCCTG GGGCGGTGCT 4560 

GGCCAATGCT GGAAACATTC ACTTTCCTGC AGCCTCTTGG GTGCTTCTCT CCTATCTGTG 4620 

CCTCTTCAGT GGGGGTTTGG GGACCATATC AGGAGACCTG GGTTGTGCTG ACAGCAAAGA 4680 

TCCACTTTGG CAGGAGCCCT GACCCAGCTA GGAGGTAGTC TGGAGGGCTG GTCATTCACA 4 74 0 

GATCCCCATG GTCTTCAGCA GACAAGTGAG GGTGGTAAAT GTAGGAGAAA GAGCCTTGGC 4 8 00 

CTTAAGGAAA TCTTTACTCC TGTAAGCAAG AGCCAACCTC ACAGGATTAG GAGCTGGGGT 4 86 0 

AGAACTGGCT ATCCTTGGGG AAGAGGCAAG CCCTGCCTCT GGCCGTGTCC ACCTTTCAGG 4 92 0 



AGACTTTGAG TGGCAGGTTT GGACTTGGAC 
GAGATTCCAG AAATCTGCTG CATTTCACAT 
CCACTGATAT CCATGATGCT GGGTGCCCCA 
TGCCTAGCTT GAGGGGTCTG CAGTCCAGTA 
ATGCCAGGTG GAGAAATCAC AGAGAGGTAA 
GCTCAGGAAG GCTTCTTGCT TACAGGAATG 
GAGGCAGCCT CTGGAATGGC TCAGGGATTC 
GTGACTACGG GGTCGCCCTT TGCTCACGTC 
GTCAGAGGGG AGCAATGGGC TTTGCTGCTT 
GCCCTGCCTC TGACTCCAAG AGGGTGAAGT 
CATTTGCTCT TCATCCAGGG AACTGAGCAC 
CGTACTCCCT CGGCCTGGGA TTTCAGAGCT 
TTCATTTTAA CAGATGGGGA AAGTGAGCCC 
GAGGGCCTGG GGAGCCCCAC CCTAGCCCTT 
CCCCAGAGTG CCCAGGCACT CCTGAGGTAG 
GGAAAGGAAA TGACTAGAGT AGAATGACAG 
GCACACAAAC CCGCCCTCCC CTTGGTGTTG 
ACCTGTCAGC CCAGCCTGGG TGCACAGTAG 
TCCTGTCTCT GCAGCTCTAC AGGTGAGGCC 
TGGTGAGCCA ATTTGGCTGA TCTTGGGTGT 
TTTCAGCTGC TGCTTAATGC CCTGCTCTCT 
GCTCCTGTAA GAGGGAGAAC TCTATCTGTG 
TCCCTGGGTC TTGTGATGAA CTACATTTAT 
CCTTCAAAGA GGGCCTGCCT GGCTCCCTCC 
GAGTCCATTC CCCAGGTGGG AGCCAACTGT 
AGCTGCTGGG AGGTGACCAT AGGGCTCTGC 
AGTCACAGGA AGGACTTCTT CCAGGGAGAT 
CCTCATGTCC TTCTTGTCCA CGGTTTTGTT 
ACACTGTGAA CCACTTAGGA TGTGATCACT 
GGCTCAGTTC ATTTAAAAAA GATATCTATT 
ACAGTACAGG ATCTGTACAT AAAAGTTTCT 
TAGGCATTTT CTTGGTAGCA CAAATTTTCT 
TTCTGTTTGT AAGACTTAAG TGAGTTAGGT 
CTTGTCTTTT TTCTGTTGCC GAAATAGCTG 
GTTTGTATGT AAACATTTCT TGTAGGCATC 
TTATTATATG TGCACTTCAA GAAGTCACTG 



TAGATGACTC TCAAAGGCCC TTTTAGTTCT 
GGTACCTGGA ACCCAACAGT TCATGGATAT 
GCGCACACGG GATGGAGAGG TGAGAACTAA 
GGGCAGGCAG TCAGGTCCAT GTGCACTGCA 
AATGGAGGCC AGTGCCATTT CAGAGGGGAG 
AAGGCTGGGG GCATTTTGCT GGGGGGAGAT 
AGCCCTCCCT GCCGCTGCCT GCTGAAGCTG 
TCTCTGGCCC ACTCATGATG GAGAAGTGTG 
ATGAGCACAG AGGAATTCAG TCCCCAGGCA 
CCACAGAAGT GAGCTCCTGC CTTAGGGCCT 
AGGGGGCCTC CAGGAGACCC TAGATGTGCT 
GGAAATATAG AAAATATCTA GCCCAAAGCC 
CCT^GATGGG AAAGAACCAC ACAGCTAAGG 
GCTGCCACAC CACATTGCCT CAACAACCGG 
CTTCTGGAAA TGGGGACAAG TCCCCTCGAA 
CTAGCAGATC TCTTCCCTCC TGCTCCCAGC 
GCGGTCCCTG TGGCCTTCAC TTTGTTCACT 
CTGCAACTCC CCATTGGTGC TACCTGGCTC 
CAGCAGAGGG AGTAGGGCTC GCCATGTTTC 
CTGAACAGCT ATTGGGTCCA CCCCAGTCCC 
CCCTGGCCCA CCTTATAGAG AGCCCAAAGA 
GTTTATAATC TTGCACGAGG CACCAGAGTC 
CCCCTTTCCT GCCCCAACCA CAAACTCTTT 
ACCCAACTGC ACCCATGAGA CTCGGTCCAA 
CAGGGAGGTC TTTCCCACCA AACATCTTTC 
TTTTAAAGAT ATGGCTGCTT CAAAGGCCAG 
TAGTGGTGAT GGAGAGGAGA GTTAAAATGA 
GAGTTTTCAC TCTTCTAATG CAAGGGTCTC 
TTCAGGTGGC CAGGAATGTT GAATGTCTTT 
TGAAAGTTCT CAGAGTTGTA CATATGTTTC 
TTCCTAAACC ATTCACCAAG AGCCAATATC 
TATTGCTTAG AAAATTGTCC TCCTTGTTAT 
CTTTAAGGAA AGCAACGCTC CTCTGAAATG 
GTCCTTTTTC GGGAGTTAGA TGTATAGAGT 
ACCATGAACA AAGATATATT TTCTATTTAT 
TCAGAGAAAT AAAGAATTGT CTTAAATGTC 



SEP ID NO: 16 Ccq ID NO; 16 Protein sequence: 
Protein Accession #: XP_051860.2 

1 11 21 31 41 51 

I I I I I I 

MGAAGRQDFL FKAMLTISWL TLTCFPGATS TVAAGCPDQS PELQPWNPGH DQDHHVHIGQ 
GKTLLLTSSA TVYSIHISEG GKLVIKDHDE PIVLRTRHIL IDNGGELHAG S/UiCPFQGNF 
TIILYGRADE GIQPDPYYGL KYIGVGKGGA LELHGQKKLS WTFLNKTLHP GGMAEGGYFF 
ERSWGHRGVI VHVIDPKSGT VIHSDRFDTY RSKKESERLV QYLNAVPDGR ILSVAVNDEG 
SRNLDDMARK AMTKLGSKHF LHLGFRHPWS FLTVKGNPSS SVEDHIEYHG HRGSAAARVF 
KLFQTEHGEY FNVSLSSEWV QDVEWTEWFD HDKVSQTKGG EKISDLWKAH PGKICNRPID 
IQATTMDGVN LSTEWYKKG QDYRFACYDR GRACRSYRVR FLCGKPVRPK LTVTIDTNVN 
STILNLEDNV QSWKPGDTLV lASTDYSMYQ AEEFQVLPCR SCAPNQVKVA GKPMYLHIGE 
EIDGVDMRAE VGLLSRNIIV MGEMEDKCYP YRNHICNFFD FDTFGGHIKF ALGFKAAHLE 
GTELKHMGQQ LVGQYPIHFH LAGDVDERGG YDPPTYIRDL SIHHTFSRCV TVHGSNGLLI 
KDWGYNSLG HCFFTEDGPE ERNTFDHCLG LLVKSGTLLP SDRDSKMCKM ITEDSYPGYI 
PKPRQDCNAV STFWMANPNN NLINCAAAGS EETGFWFIFH HVPTGPSVGM YSPGYSEHIP 
LGKFYNNRAH SNYRAGMIID NGVKTTEASA KDKRPFLSII SARYSPHQDA DPLKPREPAI 
IRHFIAYKNQ DHGAWLRGGD VWLDSCRFAD NGIGLTLASG GTFPYDDGSK QEIKNSLFVG 
ESGNVGTEMM DNRIWGPGGL DHSGRTLPIG QNFPIRGIQL YDGPINIQNC TFRKFVALEG 
RHTSAIiAFRIi NNAWQSCPHN NVTGIAFEDV PITSRVFFGE PGPWFNQLDM DGDKTSVFHD 
VDGSVSEYPG SYLTKNDNWL VRHPDCINVP DWRGAICSGC YAQMYIQAYK TSNLRMKIIK 
NDFPSHPLYL EGALTRSTHY QQYQPWTLQ KGYTIHWDQT APAELAIWLI NFNKGDWIRV 



4980 
5040 
5100 
5160 
5220 
5280 
5340 
5400 
5460 
5520 
5580 
5640 
5700 
5760 
5820 
5880 
5940 
6000 
6060 
6120 
6180 
6240 
6300 
6360 
6420 
6480 
6540 
6600 
6660 
6720 
6780 
6840 
6900 
6960 
7020 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 



GLCYPRGTTF SILSDVHNRL LKQTSKTGVF VRTLQMDKVE QSYPGRSHYY WDEDSGLLFL 1140 

KLKAQNEREK FAFCSMKGCE RIKIKALIPK NAGVSDCTAT AYPKFTERAV VDVPMPKKLF 1200 

GSQLKTKDHF LEVKMESSKQ HFFHLWNDFA YIEVDGKKYP SSEDGIQVW IDGNQGRWS 1260 

HTSFRNSILQ GIPWQLFNYV ATIPDNSIVL MASKGRYVSR GPWTRVLEKL GADRGLKLKE 1320 
QMAFVGFKGS FRPIWVTIiDT EDHKAKIFQV VPIPWKKKK L 



SEP ID NO: 17 Ccq ID MO ; 17 DNA sequence 
Nucleic Acid Accession #: NM_015515.1 
Coding sequence: 61-1329 

1 11 21 31 41 51 

I I I I I I 

AGTTCTGCGG TGCCAGGGAG TGGAGCAGAG CTCAGCCCCG TCCCAAACAC AGATGGGACC 60 

ATGAACTCCG GACACAGCTT CAGCCAGACC CCCTCGGCCT CCTTCCATGG CGCCGGAGGT 12 0 

GGCTGGGGCC GGCCCAGGAG CTTCCCCAGG GCTCCCACCG TCCATGGCGG TGCGGGGGGA 180 

GCCCGCATCT CCCTGTCCTT CACCACGCGG AGCTGCCCAC CCCCTGGAGG GTCTTGGGGT 240 

TCTGGAAGAA GCAGCCCCCT ACTAGGCGGA AATGGGAAGG CCACCATGCA GAATCTCAAC 3 00 

GACCGCCTGG CCTCCTACCT GGAGAAGGTT CGCGCCCTGG AGGAGGCCAA CATGAAGCTG 360 

GAAAGCCGCA TCCTGAT^TG GCACCAGCAG AGAGATCCTG GCAGTAAGAA AGATTATTCC 420 

CAGTATGAGG AAAACATCAC ACACCTGCAG GAGCAGATAG TGGATGGTAA GATGACCAAT 4 80 

GCTCAGATTA TTCTTCTCAT TGACAATGCC AGGATGGCAG TGGATGACTT CAACCTCAAG 540 

TATGAAAATG AACACTCCTT TAAGAAAGAC TTGGAAATTG AAGTCGAGGG CCTCCGAAGG 600 

ACCTTAGACA ACCTGACCAT TGTCACAACA GACCTAGAAC AGGAGGTGGA AGGAATGAGG 660 

AAAGAGCTCA TTCTCATGAA GGAGCACCAT GAGCAGGAAA TGGAGGAGCA TCATGTGCCA 720 

AGTGACTTCA ATGTCAATGT GAAGGTGGAT ACAGGTCCCA GGGAAGATCT GATTAAGGTC 7 80 

CTGGAGGATA TGAGACAAGA ATATGAGCTT ATAATAAAGA AGAAGCATCG AGACTTGGAC 840 

ACTTGGTATA AAGAACAGTC TGCAGCCATG TCCCAGGAGG CAGCCAGTCC AGCCACTGTG 900 

CAGAGCAGAC AAGGTGACAT CCACGAACTG AAGCGCACAT TCCAGGCCCT GGAGATTGAC 960 

CTGCAGGCAC AGTACAGCAC GAAATCTGCT TTGGAAAACA TGTTATCCGA GACCCAGTCT 102 0 

CGGTACTCCT GCAAGCTCCA GGACATGCAA GAGATCATCT CCCACTATGA GGAGGAACTG 1080 

ACGCAGCTAC GCCACGAACT GGAGCGGCAG AACAATGAAT ACCAAGTGCT GCTGGGCATC 1140 

AAAACCCACC TGGAGAAGGA AATCACCACG TACCGACGGC TCCTGGAGGG AGAGAGTGAA 12 00 

GGGACACGGG AAGAATCAAA GTCGAGCATG AAAGTGTCTG CAACTCCAAA GATCAAGGCC 1260 

ATAACCCAGG AGACCATCAA CGGAAGATTA GTTCTTTGTC AAGTGAATGA AATCCAAAAG 1320 

CACGCATGAG ACCAATGAAA GTTTCCGCCT GTTGTAAAGT CTATTTTCCC CCAAGGAAAG 13 80 

TCCTTGCACA GACACCAGTG AGTGAGTTCT AAAAGATACC CTTGGAATTA TCAGACTCAG 1440 

AAACTTTTAT tTTTTTTTTT CTGTAACAGT CTCACCAGAC TTCTCATAAT GCTCTTAATA 1500 

TATTGCACTT TTCTAATCAA AGTGCGAGTT TATGAGGGTA AAGCTCTACT TTCCTACTGC 1560 
AGCCTTCAGA TTCTCATCAT TTTGCATCTA TTTTGTAGCC AATAAAACTC CGCACTAGC 



SEP ID NO: 18 Ccq ID NO ; — ^ Protein sequence: 
Protein Accession #: NP_056330.1 

1 11 21 31 41 51 

I I I I I I 

MNSGHSFSQT PSASFHGAGG GWGRPRSFPR APTVHGGAGG ARISLSFTTR SCPPPGGSWG 60 

SGRSSPLLGG NGKATMQNLN DRLASYLEKV RALEEANMKL ESRILKWHQQ RDPGSKKDYS 12 0 

QYEENITHLQ EQIVDGKMTN AQIILLIDNA RMAVDDFNLK YENEHSFKKD LEIEVEGLRR 180 

TLDNLTIVTT DLEQEVEGMR KELILMKEHH EQEMEEHHVP SDFNVNVKVD TGPREDLIKV 24 0 

LEDMRQEYEL IIKKKHRDLD TWYKEQSAAM SQEAASPATV QSRQGDIHEL KRTFQALEID 3 00 

LQAQYSTKSA LENMLSETQS RYSCKLQDMQ EIISHYEEEL TQLRHELERQ NNEYQVLLGI 3 60 

KTHLEKEITT YRRLLEGESE GTREESKSSM KVSATPKIKA ITQETINGRL VliCQVNEIQK 42 0 
HA 



SEP ID NO: 19 Ccq ID NO: IP DNA sequence 

Nucleic Acid Accession #: see Table 25 & 25A for complete li 



1 11 21 31 41 51 

TTAAAAAAAA GAGGCTTGGT AAGTTTTTGA TACTTAGTTG ACTTTTAGCA 60 

TTATCCAGCA TTTGTATTAT GAACCAGTGA GTACTGTAAT TTTTCTTTCC CTTTCAGAAA 120 

GACTCAAAGG GAACATATAA ATGTTTCCTA TTTTTNNNNN NNNNNNNNNN NNNNNNNNNN 180 

NNNNACCCAT CGTGCGATGA TCNNNNNNNN NNNNNNNNNN NNNNNTTGGG ATCCAGTTTC 240 

AAATAAGGTA TGGGAAAAAC AGATGTTTTC ATTATCGCCA CTTAATCCTT ACTTCCGATT 300 

ATAATTATAC ATGTTTGGCT GTAATAACTA TACTAAAGCA TGCTTGTGAA AGTAGACTTC 360 

TACAAGGACA GAAAACCCAC AACAACAAAG ATCGATCACG AAAGACAAGG CATATTCATT 420 

CATTAATTTA CTTCTCTTAG ACCCGGGACA TGTGGGACAA ATACTTTTGT CCTCATGGAT 4 80 

GGCTTGATAA TTTATTTATA TGTTCTAGAG TCTGAGGATT TTCTTTCAGT GGCAGACAAC 540 

AAAGGATGTT ACAATTTACT TCAAAATAAT ACAATCATGG TTTAATTTAC AGTGTAAATC 600 

CATAACTATT TTATAGAGAT GGATTATCAT ACATGGGATT ATAAAAATAA CTTACCCATA 660 

TGCTTGCAAA ATAGACTTTT CCTATTGGGA GGAACATCTT TTAACCTAAA ACGGATTTAT 720 

TTCAGATGAA TTAGACAGTA CATTTTTCAG GAGAACCAGC CTTACTGGAT GATCTTTTGT 780 

CAGGTTTGGA GGCCTCTTCT TTGTCTTTGC AACCATAACC CCTTTTCAGC TGAAGACCAC 840 
TGGCCTTCAA CCCAAGCCAG GAGTTTGGCT CAAATGA 



HBO ID NO: 20 Ccq ID MO ! 20 DNA sequence 
Nucleic Acid Accession #: D32051.1 
Coding sequence: 72-1373 

1 11 21 31 41 51 

I I I I I I 

GAATTCGAAC CAGGTGGCCA CCCGGTGTCG GTTTCATTTT CCTTTGGAAT TTCTGCTTTA 60 

CAGACAGAAC AATGGCAGCC CGAGTACTTA TAATTGGCAG TGGAGGAAGG GAACATACGC 120 
TGGCCTGGAA ACTTGCACAG TCTCATCATG TCAAACAAGT GTTGGTTGCC CCAGGAAACG 180 

CAGGCACTGC CTGCTCTGAA AAGATTTCAA ATACCGCCAT CTCAATCAGT GACCACACTG 240 
CCCTTGCTCA ATTCTGCAAA GAGAAGAAAA TTGAATTTGT AGTTGTTGGA CCAGAAGCAC 3 00 
CTCTGGCTGC TGGGATTGTT GGGAACCTGA GGTCTGCAGG AGTGCAATGC TTTGGCCCAA 360 
CAGCAGAAGC GGCTCAGTTA GAGTCCAGCA AAAGGTTTGC CAAAGAGTTT ATGGACAGAC 420 
ATGGAATCCC AACCGCACAA TGGAAGGCTT TCACCAAACC TGAAGAAGCC TGCAGCTTCA 480 
TTTTGAGTGC AGACTTCCCT GCTTTGGTTG TGAAGGCCAG TGGTCTTGCA GCTGGAAAAG 540 
GGGTGATTGT TGCAAAGAGC AAAGAAGAGG CCTGCAAAGC TGTACAAGAG ATCATGCAGG 600 
AGAAAGCCTT TGGGGCAGCT GGAGAAACAA TTGTCATTGA AGAACTTCTT GACGGAGAAG 660 
AGGTGTCGTG TCTGTGTTTC ACTGATGGCA AGACTGTGGC CCCCATGCCC CCAGCACAGG 720 
ACCATAAGCG ATTACTGGAG GGAGATGGTG GCCCTAACAC AGGGGGAATG GGAGCCTATT 780 
GTCCAGCCCC TCAGGTTTCT AATGATCTAT TACTAAAAAT TAAAGATACT GTTCTTCAGA 840 
GGACAGTGGA TGGCATGCAG CAAGAGGGTA CTCCATATAC AGGTATTCTC TATGCTGGAA 900 
TAATGCTGAC CAAGAATGGC CCAAAAGTTC TAGAGTTTAA TTGCCGTTTT GGTGATCCAG 960 

AGTGCCAAGT AATCCTCCCA CTTCTTAAAA GTGATCTTTA TGAAGTGATT CAGTCCACCT 1020 

TAGATGGACT GCTCTGCACA TCTCTGCCTG TTTGGCTAGA AAACCACACC GCCCTAACTG 1080 

TTGTCATGGC AAGTAAAGGT TATCCTGGAG ACTACACCAA GGGTGTAGAG ATAACAGGGT 1140 

TTCCTGAGGC TCAAGCTCTA GGACTGGAGG TGTCCCATGC AGGCACTGCC CTCAAAAATG 12 00 

GCAAAGTAGT AACTCATGGG GGTAGAGTTC TTGCAGTCAC AGCCATCCGG GAAAATCTCA 1260 

TATCAGCCCT TGAGGAAGCC AAGAAAGGAC TAGCTGCTAT AAAGTTTGAG GGAGCAATTT 1320 

ATAGGAAAGA CATCGGCTTT CGTGCCATAG CTTTCCTCCA GCAGCCCAGG TAAAACTCTA 1380 

AGCAAGTTAG CTGTAGTGCC ATTTCAGAAA CTGGCCTAAA TGGCTATGTA GAACATTCCA 144 0 

TTAACCCTAT AAGTCATTCA GTATTCTTTT CTCTCTGTGG GAGTGATACA GTCTTGGTTT 1500 

GTATTTTGTT TGAATCAAAA CTGGTTATAG CAATACTCAA ATGGAAAAAA CTTCATGATA 1560 

GCGTAAGTTT GGAAAGTTTA GCAAAATCAC AGTGGTACTG ATTTTTATTT GTTTTCTATT 1620 

^^^^^^^^^^ TATATTTTTA ATTTTTTTAA CAGGGTCTTC CTCTCTCGCC CAAGTTCTCA 1680 

TGCCTCAGCC TCCCAAATAG CTGGGACTAC AGGCACAGGC CACCACACCT GGCTAATTTT 174 0 

TTTGTATTTT TTGTGGAGAT GGGGTTCACC ATGTTGCCAA GGCCAGTCTG AAAGCCTGGG 1800 

CTCAAGTGAT CCTCCTGCTT TGGCCTCCCA AAATGCTGGG ACTATAGGCA TGAGGCGCTG 1860 

CACTTGGCCT GATACTGATT TTTATTCCTT GCGTTATCAC ATAGTGTTGT ATTTGAAACA 1920 

TAGTTCATGG TTTTATCAAA GAACTGAAGA TGAGAATACT GGTCATCTAA CTTTGTAATT 1980 



TGATTTGATT ATACTGTAAA GTTTGACAGT CCCATTTTAA CCTGCGTTTG TATCTATTAC 204 0 
TAAAATGTAT TTTTTGACCT CTTACTGATT CATGGTTGGT ATGTACAAAC TGTTGACTTG 2100 
TAAAATCAAT AAAGTCTTAG TTGG 



SEP ID NO: 21 Ccq ID NO! 21 Protein sequence: 
Protein Accession #: BAA06809.1 



MAARVLIIGS 
FCKEKKIEFV 
TAQWKAFTKP 
GAAGETIVIE 
QVSNDLLLKI 
ILPLLKSDLY 
QAIiGLEVSHA 
IGFRAIAFLQ 



11 
I 

GGREHTLAWK 
WGPEAPLAA 
EEACSFILSA 
ELLDGEEVSC 
KDTVLQRTVD 
EVIQSTLDGL 
GTALKNGKW 
QPR 



21 
I 

LAQSHHVKQV 
GIVGNLRSAG 
DFPALWKAS 
LCFTDGKTVA 
GMQQEGTPYT 
LCTSLPVWLE 
THGGRVLAVT 



31 
I 

LVAPGNAGTA 
VQCFGPTAEA 
GLAAGKGVIV 
PMPPAQDHKR 
GILYAGIMLT 
NHTALTWMA 
AIRENLISAL 



41 
I 

CSEKISNTAI 
AQLESSKRFA 
AKSKEEACKA 
LLEGDGGPNT 
KNGPKVLEFN 
SKGYPGDYTK 
EEAKKGLAAI 



51 
I 

SISDHTALAQ 
KEFMDRHGIP 
VQEIMQEKAF 
GGMGAYCPAP 
CRFGDPECQV 
GVEITGFPEA 
KFEGAIYRKD 



60 
120 
180 
240 
300 
360 
420 



SEP ID NO: 22 Ccq ID NO; 22 DNA sequence 
Nucleic Acid Accession #: EOS cloned 

Coding sequence: 1-2424 

1 11 21 31 41 51 

I I I I I I 

ATGCCCCCTT TCCTGTTGCT GGAGGCCGTC TGTGTTTTCC TGTTTTCCAG AGTGCCCCCA 60 

TCTCTCCCTC TCCAGGAAGT CCATGTAAGC AAAGAAACCA TCGGGAAGAT TTCAGCTGCC 120 

AGCAAAATGA TGTGGTGCTC GGCTGCAGTG GACATCATGT TTCTGTTAGA TGGGTCTAAC 180 

AGCGTCGGGA AAGGGAGCTT TGAAAGGTCC AAGCACTTTG CCATCACAGT CTGTGACGGT 24 0 

CTGGACATCA GCCCCGAGAG GGTCAGAGTG GGAGCATTCC AGTTCAGTTC CACTCCTCAT 300 

CTGGAATTCC CCTTGGATTC ATTTTCAACC CAACAGGAAG TGAAGGCAAG AATC7\AGAGG 360 

ATGGTTTTCA AAGGAGGGCG CACGGAGACG GAACTTGCTC TGAAATACCT TCTGCACAGA 420 

GGGTTGCCTG GAGGCAGAAA TGCTTCTGTG CCCCAGATCC TCATCATCGT CACTGATGGG 480 

AAGTCCCAGG GGGATGTGGC ACTGCCATCC AAGCAGCTGA AGGAAAGGGG TGTCACTGTG 540 

TTTGCTGTGG GGGTCAGGTT TCCCAGGTGG GAGGAGCTGC ATGCACTGGC CAGCGAGCCT 600 

AGAGGGCAGC ACGTGCTGTT GGCTGAGCAG GTGGAGGATG CCACCAACGG CCTCTTCAGC 660 

ACCCTCAGCA GCTCGGCCAT CTGCTCCAGC GCCACGCCAG ACTGCAGGGT CGAGGCTCAC 72 0 

CCCTGTGAGC ACAGGACGCT GGAGATGGTC CGGGAGTTCG CTGGCAATGC CCCATGCTGG 7 80 

AGAGGATCGC GGCGGACCCT TGCGGTGCTG GCTGCACACT GTCCCTTCTA CAGCTGGAAG 84 0 

AGAGTGTTCC TAACCCACCC TGCCACCTGC TACAGGACCA CCTGCCCAGG CCCCTGTGAC 900 

TCGCAGCCCT GCCAGAATGG AGGCACATGT GTTCCAGAAG GACTGGACGG CTACCAGTGC 960 

CTCTGCCCGC TGGCCTTTGG AGGGGAGGCT AACTGTGCCC TGAAGCTGAG CCTGGAATGC 102 0 

AGGGTCGACC TCCTCTTCCT GCTGGACAGC TCTGCGGGCA CCACTCTGGA CGGCTTCCTG 1080 

CGGGCCAAAG TCTTCGTGAA GCGGTTTGTG CGGGCCGTGC TGAGCGAGGA CTCTCGGGCC 1140 

CGAGTGGGTG TGGCCACATA CAGCAGGGAG CTGCTGGTGG CGGTGCCTGT GGGGGAGTAC 1200 

CAGGATGTGC CTGACCTGGT CTGGAGCCTC GATGGCATTC CCTTCCGTGG TGGCCCCACC 1260 

CTGACGGGCA GTGCCTTGCG GCAGGCGGCA GAGCGTGGCT TCGGGAGCGC CACCAGGACA 132 0 

GGCCAGGACC GGCCACGTAG AGTGGTGGTT TTGCTCACTG AGTCACACTC CGAGGATGAG 13 80 

GTTGCGGGCC CAGCGCGTCA CGCAAGGGCG CGAGAGCTGC TCCTGCTGGG TGTAGGCAGT 1440 

GAGGCCGTGC GGGCAGAGCT GGAGGAGATC ACAGGCAGCC CAAAGCATGT GATGGTCTAC 1500 

TCGGATCCTC AGGATCTGTT CAACCAAATC CCTGAGCTGC AGGGGAAGCT GTGCAGCCGG 1560 

CAGCGGCCAG GGTGCCGGAC ACAAGCCCTG GACCTCGTCT TCATGTTGGA CACCTCTGCC 1620 

TCAGTAGGGC CCGAGAATTT TGCTCAGATG CAGAGCTTTG TGAGAAGCTG TGCCCTCCAG 1680 

TTTGAGGTGA ACCCTGACGT GACACAGGTC GGCCTGGTGG TGTATGGCAG CCAGGTGCAG 1740 

ACTGCCTTCG GGCTGGACAC CAAACCCACC CGGGCTGCGA TGCTGCGGGC CATTAGCCAG 1800 

GCCCCCTACC TAGGTGGGGT GGGCTCAGCC GGCACCGCCC TGCTGCACAT CTATGACAAA 1860 

GTGATGACCG TCCAGAGGGG TGCCCGGCCT GGTGTCCCCA AAGCTGTGGT GGTGCTCACA 192 0 

GGCGGGAGAG GCGCAGAGGA TGCAGCCGTT CCTGCCCAGA AGCTGAGGAA CAATGGCATC 1980 

TCTGTCTTGG TCGTGGGCGT GGGGCCTGTC CTAAGTGAGG GTCTGCGGAG GCTTGCAGGT 2040 

CCCCGGGATT CCCTGATCCA CGTGGCAGCT TACGCCGACC TGCGGTACCA CCAGGACGTG 2100 



CTCATTGAGT 
TGCATGAATG 
GGCTGGGAGG 
CAGGGATGGA 
CGTACCCCTC 
AATGTCTGTG 



GGCTGTGTGG 
AGGGCAGCTG 
GCCCCCACTG 
TTCTTGAGAC 
CCAGCAACTA 
CCCCAGGTCC 



AGAAGCCAAG 
CGTCCTGCAG 
CGAGAACCGT 
GCCCCTGAGG 
CAGAGAAGGC 
TTAG 



CAGCCAGTCA ACCTCTGCAA ACCCAGCCCG 2160 

AATGGGAGCT ACCGCTGCAA GTGTCGGGAT 222 0 

GAGTGGAGCT CTTGCTCTGT ATGTGTGAGC 22 80 

CACATGGCTC CCGTGCAGGA GGGCAGCAGC 234 0 

CTGGGCACTG AAATGGTGCC TACCTTCTGG 24 00 



SEP ID NO: 23 Ccq ID NO ; 23 Protein sequence: 
Protein Accession #: EOS cloned 



I 

MPPFLLLEAV 
SVGKGSFERS 
MVFKGGRTET 
FAVGVRFPRW 
PCEHRTLEMV 
SQPCQNGGTC 
RAKVFVKRFV 
LTGSALRQAA 
EAVRAELEEI 
SVGPENFAQM 
APYLGGVGSA 
SVLWGVGPV 
CMNEGSCVLQ 
RTPPSNYREG 



11 

I 

CVFLFSRVPP 
KHFAITVCDG 
ELALKYLLHR 
EELHALASEP 
REFAGNAPCW 
VPEGLDGYQC 
RAVLSEDSRA 
ERGFGSATRT 
TGSPKHVMVY 
QSFVRSCALQ 
GTALLHIYDK 
LSEGLRRIiAG 
NGSYRCKCRD 
liGTEMVPTFW 



21 

I 

SLPLQEVHVS 
LDISPERVRV 
GLPGGRNASV 
RGQHVLLAEQ 
RGSRRTLAVL 
LCPLAFGGEA 
RVGVATYSRE 
GQDRPRRVW 
SDPQDLFNQI 
FEVNPDVTQV 
VMTVQRGARP 
PRDSLIHVAA 
GWEGPHCENR 
NVCAPGP 



31 

I 

KETIGKISAA 
GAFQFSSTPH 
PQILIIVTDG 
VEDATNGLFS 
AAHCPFYSWK 
NCALKLSLEC 
LLVAVPVGEY 
LLTESHSEDE 
PELQGKLCSR 
GLWYGSQVQ 
GVPKAVWLT 
YADLRYHQDV 
EWSSCSVCVS 



41 

I 

SKMMWCSAAV 
LEFPLDSFST 
KSQGDVALPS 
TLSSSAICSS 
RVFLTHPATC 
RVDLLFLLDS 
QDVPDLVWSL 
VAGPARHARA 
QRPGCRTQAL 
TAFGLDTKPT 
GGRGAEDAAV 
LIEWLCGEAK 
QGWILETPLR 



51 

I 

DIMFLLDGSN 
QQEVKARIKR 
KQLKERGVTV 
ATPDCRVEAH 
YRTTCPGPCD 
SAGTTLDGFL 
DGIPFRGGPT 
RELLLLGVGS 
DLVFMLDTSA 
RAAMLRAISQ 
PAQKLRNNGI 
QPVNLCKPSP 
HMAPVQEGSS 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 



SEO ID NO: 24 Ccq ID NO i 21 DNA sequence 

Nucleic Acid Accession #: see Table 25 & 25A for complete list 



AGGTCGGCTG 
CTAGGACTCA 
AAGTGCGCGG 
TTGATGGATA 
GCAAATCGAA 
TCACCTAAAG 
CACGCAAGTA 
TTTTTGAAAG 
AGCCAACAGG 
TAGATAAGAA 
ATGAAAAAAA 
TTTTTAACTA 
TGAATGTTGG 
ATGATGATGG 
TCTGGAGTTA 
GGAATGAATG 
GTGTCAAATT 
ATTGGTTGAA 
ATCCATATTA 
TCAGTTTCAT 
ATCTAAAGTG 
ATTTATTGAC 
ACTATTTCTA 
AAGGAATAAA 
ACAATCTGTT 
CGTTGTGGCG 
TGGTGTCTCC 



11 
I 

GTTATCGGGA 
CGCACCAGGC 
TTGGGGCCAC 
AATCAGGAAT 
GTGATAACTC 
GGGAGAAAAG 
TTCATCAAGT 
ATTCAAGAAC 
AAGACCACGG 
GAAACAATTT 
CGCACCTTGG 
TATTGAAAAA 
TGAAGATTTA 
ATCCATTTCT 
AAGGTATTGG 
TGATTTGGAC 
GAGAACCAGG 
GAAATTAGAT 
AAACGAGATG 
TAAAAAGGGC 
ACCTTGATGG 
TTTAAATAAT 
GAAAGTCAGC 
AATTTGAGGT 
GTGCGGCGCC 
GCCCGGGGGC 
CCAAGAGAGA 



21 
I 

GTTGGAGGGC 
CAGTCGCGGG 
TGCGAGGCCG 
AGATTCTCTT 
TTCTGATAGC 
T^AACCCCATT 
GACTCATCTT 
AGGAAAAAGA 
GGAAGACCAG 
AGAAGCAGAG 
AGAAAAATTT 
CTGAAGTATG 
GAAAATGAAG 
CCTATTGAGG 
CATACTCCAC 
TCATTCATGT 
TAGATCCCCA 
CCCAAAGATT 
ACAGAAGCCA 
AAACTTGAAG 
ACAGTGGAAG 
TTTGTCTAAT 
ATGTATTTTT 
TTCAATACAA 
CCTGGGCCCC 
CCCACAGTTG 
GAACACCTCC 



31 
I 

TGAGGTCGGG 
TTTTGGGCCG 
TTTTAGAAAA 
GACCATGTGA 
AGCTTATTTA 
CGAAAATTTG 
TTGAACCAGT 
GATATAAAAA 
AAGGAAGGAG 
GATCTGGCTT 
TAACGTTTGA 
AACACCACCT 
ATTTTGACAG 
AGTCAACGCT 
ACATCTGTAC 
ATGAGAGTAA 
CCACCTACAG 
CTTGGTGAAT 
AAGTAATTAT 
ATAAATCTTT 
AAATCACAAC 
GCTACATATA 
GGCTCGAAGT 
AAACAAAACA 
TTGAGAGAAA 
GGTTTAGGTG 
GGGGTCAAGC 



41 

I 

AGGGTGGTGT 
AGGCCTGGGT 
CTGTTTAAAA 
CATCTGATGC 
AAACTCAGTG 
TTCGTACACC 
ACCATTGACT 
AAAGAAAAAG 
AAATCCTATA 
CCCATTTTTA 
GCAAGCTGTT 
GAAAGAATCA 
TCGTAGATAC 
TTTATCTTGA 
CATTCTTGAG 
GCAATGCTTT 
TAAAAAGGAC 
TTTGAAGTCT 
GGGCTGACAG 
TGACTCCAGC 
ATGGAATTCC 
CACAATTAAA 
TTCTCTAGTG 
AACAACACGA 
ACTTTTTAGA 
GGCACCCTTG 
GGACAACAAG 



51 
I 

GTACAGAGCT 
TACAAGCAGC 
CAAAGAGCAA 
TGTGGAACTT 
TATCCCTTAC 
TGAAAGTGTT 
ATAAAAGCTA 
AGGAGGTACC 
TACTCACTAA 
GAATCAGAGA 
GCAAGAGGAT 
TTGAAGCAAA 
AAATTTTTGG 
GGACATGGTG 
TGATCGCTTA 
TTTTTCCAGG 
CCTAAAGTAA 
TCATCAGTAT 
GACAACTGGA 
TCTTTAGAGG 
TCGAATAACA 
AAACCTTTAC 
TTTTCTGTGG 
AACACGAAAA 
ACCCCTTTTG 
TGTCTACAAG 
AGTGCGTCGT 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 



GAGGACTCTT CACCCAAAGT ATATAAAACC CGCCCCGCGG GGGAACCACC GGCCGCTTTT 
CTGTAGACAC AACCCCCACA GTGGGAACCT CTGAGGGCGC ACACACAGGG CGAGCCTTAT 
CAACAAGGGG TGCCCAACAG AAACCCCGAG TTAAAAATCG 



1680 
1740 



SEP ID NO: 25 Ccq ID NO ; 25 DNA sequence 
Nucleic Acid Accession #: BC001972.1 
Coding sequence: 183-1019 

1 11 21 31 41 51 

I I i I I I 

GGTCGGCTGG TTATCGGGAG TTGGAGGGCT GAGGTCGGGA GGGTGGTGTG TACAGAGCTC 60 

TAGGACTCAC GCACCAGGCC AGTCGCGGGT TTTGGGCCGA GGCCTGGGTT ACAAGCAGCA 12 0 

AGTGCGCGGT TGGGGCCACT GCGAGGCCGT TTTAGAAAAC TGTTTAAAAC AAAGAGCAAT 180 

TGATGGATAA ATCAGGAATA GATTCTCTTG ACCATGTGAC ATCTGATGCT GTGGAACTTG 24 0 

CAAATCGAAG TGATAACTCT TCTGATAGCA GCTTATTTAA AACTCAGTGT ATCCCTTACT 300 

CACCTAAAGG GGAGAAAAGA AACCCCATTC GAAAATTTGT TCGTACACCT GAAAGTGTTC 360 

ACGCAAGTGA TTCATCAAGT GACTCATCTT TTGAACCAAT ACCATTGACT ATAAAAGCTA 420 

TTTTTGAAAG ATTCAAGAAC AGGAAAAAGA GATATAAAAA AAAGAAAAAG AGGAGGTACC 4 80 

AGCCAACAGG AAGACCACGG GGT^GACCAG AAGGAAGGAG AAATCCTATA TACTCACTAA 54 0 

TAGATAAGAA GAAACAATTT AGAAGCAGAG GATCTGGCTT CCCATTTTTA GAATCAGAGA 600 

ATGAAAAAAA CGCACCTTGG AGAAAAATTT TAACGTTTGA GCAAGCTGTT GCAAGAGGAT 660 

TTTTTAACTA TATTGAAAAA CTGAAGTATG AACACCACCT GAAAGAATCA TTGAAGCAAA 72 0 

TGAATGTTGG TGAAGATTTA GAAAATGAAG ATTTTGACAG TCGTAGATAC AAATTTTTGG 78 0 

ATGATGATGG ATCCATTTCT CCTATTGAGG AGTCAACAGC AGAGGATGAG GATGCAACAC 840 

ATCTTGAAGA TAACGAATGT GATATCAAAT TGGCAGGGGA TAGTTTCATA GTAAGTTCTG 900 

AATTCCCTGT AAGACTGAGT GTATACTTAG AAGAAGAGGA TATTACTGAA GAAGCTGCTT 960 

TGTCTAAAAA GAGAGCTACA AAAGCCAAAA ATACTGGACA GAGAGGCCTG AAAATGTGAC 102 0 

AGGATCATGA ATGTCAAAGG CTTTTATCTT GAGAACATGG TGTCTGGAGT TAAAGGACTA 108 0 

TTGTTAGATC TGTGGGAAGG AATTACAAGA CAGTTGCTAA AAGTTTGAAA AAGACGGTTG 114 0 

CTAAACGTTA TGAAAAACCA GATAATCTAC TTTTTTACCT TAGGTATTGG CATACTCCAC 12 00 

ACATCTGTAC CATTCTTGAG TGATCGCTTA GGAATGAATG TGATTTGAAC TCATTCATGT 1260 

TGAGAGGGTG TCAAATTGAG AACCAGGTAG ATCCCCACCA CCTACAGTAA AAAGGACCCT 132 0 

AAAGTAAATT GGTTGAAGAA ATTAGATCCC AAAGATTCTT GGTGAATTTT GAAGTCTTCA 13 8 0 

TCAGTATATC CATATTAAAA CGAGATGACA GAAGCCAAAG TAATTATGGG CTGACAGGAC 144 0 

AACTGGATCA GTTTCATTAA AAAGGGCAAA CTTGAAGATA AATCTTTTGA CTCCAGCTCT 1500 

TTAGAGGATC TAAAGTGACC TTGATGGACA GTGGAAGAAA TCACAACATG GAATTCCTCG 1560 

AATAACAATT TATTGACTTT AAATAATTTT GTCTAATGCT ACATATACAC AATTAAAAAA 162 0 
CCTTTACACT AAAAAAA7VAA AAAAAA 



SEP ID NO: 26 Scq ID NO ! — Protein sequence: 
Protein Accession #: AAH01972.1 



1 11 21 31 41 51 

I I I I I I 

MDKSGIDSIiD HVTSDAVELA NRSDNSSDSS LFKTQCIPYS PKGEKRNPIR KFVRTPESVH 60 
ASDSSSDSSF EPIPLTIKAI FERFKNRKKR YKKKKKRRYQ PTGRPRGRPE GRRNPIYSLI 12 0 
DKKKQFRSRG SGFPFLESEN EKNAPWRKIL TFEQAVARGF FNYIEKLKYE HHLKESLKQM 180 
NVGEDLENED FDSRRYKFLD DDGSISPIEE STAEDEDATH LEDNECDIKL AGDSFIVSSE 24 0 
FPVRLSVYLE EEDITEEAAL SKKRATKAKN TGQRGLKM 



SEP ID NP:27 Ccq ID NO; 27 DNA sequence 
Nucleic Acid Accession #: AK027016 
Coding sequence: 207-1043 

1 11 21 31 41 51 

I I I I I I 

CTTTTCTTCC GCACGGTTGG AGGAGGTCGG CTGGTTATCG GGAGTTGGAG GGCTGAGGTC 60 
GGGAGGGTGG TGTGTACAGA GCTCTAGGAC TCACGCACCA GGCCAGTCGC GGATTTTGGG 12 0 



CCGAGGCCTG GGTTACAAGC AGCAAGTGCG CGGTTGGGGC CACTGCGAGG CCGTTTTAGA 180 

AAACTGTTTA AAACAAAGAG CAATTGATGG ATAAATCAGG AATAGATTCT CTTGACCATG 24 0 

TGACATCTGA TGCTGTGGAA CTTGCAAATC GAAGTGATAA CTCTTCTGAT AGCAGCTTAT 3 00 

TTAAAACTCA GTGTATCCCT TACTCACCTA AAGGGGAGAA AAGAAACCCC ATTCGAAAAT 3 60 

TTGTTCGTAC ACCTGAAAGT GTTCACGCAA GTGATTCATC AAGTGACTCA TCTTTTGAAC 42 0 

CAATACCATT GACTATAAAA GCTATTTTTG AAAGATTCAA GAACAGGAAA AAGAGATATA 4 80 

AAAAAAAGAA AAAGAGGAGG TACCAGCCAA CAGGAAGACC ACGGGGAAGA CCAGAAGGAA 54 0 

GGAGAAATCC TATATACTCA CTAATAGATA AGAAGAAACA ATTTAGAAGC AGAGGATCTG 600 

GCTTCCCATT TTTAGAATCA GAGAATGAAA AAAACGCACC TTGGAGAAAA ATTTTAACGT 6 60 

TTGAGCAAGC TGTTGCAAGA GGATTTTTTA ACTATATTGA AAAGCTGAAG TATGAACACC 72 0 

ACCTGAAAGA ATCATTGAAG CAAATGAATG TTGGTGAAGA TTTAGAAAAT GAAGATTTTG 780 

ACAGTCGTAG ATACAAATTT TTGGATGATG ATGGATCCAT TTCTCCTATT GAGGAGTCAA 840 

CAGCAGAGGA TGAGGATGCA ACACATCTTG AAGATAACGA ATGTGATATC AAATTGGCAG 900 

GGGATAGTTT CATAGTAAGT TCTGAATTCC CTGTAAGACT GAGTGTATAC TTAGAAGAAG 960 

AGGATATTAC TGAAGAAGCT GCTTTGTCTA AAAAGAGAGC TACAAAAGCC AAAAATACTG 102 0 

GACAGAGAGG CCTGAAAATG TGACAGGATC ATGAATGTCA AAGGCTTTTA TCTTGAGAAC 1080 

ATGGTGTCTG GAGTTAAAGG TATTGGCATA CTCCACACAT CTGTACCATT CTTGAGTGAT 114 0 

CGCTTAGGAA TGAATGTGAT TTGAACTCAT TCATGTTGAG AGGGTGTCAA ATTGAGAACC 12 00 

AGGTAGATCC CCACCACCTA CAGTAAAAAG GACCCTAAAG TAAATTGGTT GAAGAAATTA 12 60 

GATCCCAAAG ATTCTTGGTG AATTTTGAAG TCTTCATCAG TATATCCATA TTAAAACGAG 13 2 0 

ATGACAGAAG CCAAAGTAAT TATGGCAAGT AATGGTTTTT ATCTTAACTA TAAGTTATTT 1380 

GCTCAAGGGT GTAATGGTCA TTACCAAGGC TTTTAGAATG CAGTTTCTCA TTTGCTGTGG 144 0 

ACATGACCAT AAAAAAAAAT TTCCCAGTAG GTTTTCTATC TGCTACGTTG CTAGCAATCA 15 0 0 

GCTTATTGGG AACAGTTGAT TAACTGTAAT AGAAATGCAA TACAAATAAA ATGTGAACCA 156 0 

CATGTGATTT TTCTTTAAAA TCAGTGAGAT TTGAAAATTC TCCTAGATCT CTTGAATCAT 162 0 

GCAAATTTGC TTTGCCTTTA TATTGTAACC CTTGTGGGTT GCTAATAACC AAGCAGTTTG 1680 

TAGTAGAGTT AACTCAGGCT CGTTCTAGGG ACTCATTCAT GTTCACTCAC TGTACACTCA 174 0 

TCTCTGGAAA TGTAAAATTT ACTTTTATAC TATTGTTATG TAGGGCTGAC AGGACAACTG 1800 

GATCAGTTTC ATTAAAAAGG TATGTATGCA TTAGAAAAGA CATTTGTATG GGTCATTTCA 1860 

AAGAGGGCTT ATGAGGCTGT GAAACCCAGA GCTCTTAACG CTGTGACCAA .AGATGGAAGT 192 0 

TCTCTATAGG AAGCCATAGC ACTCCTAATG TTTGGTGCTA TGTTTTCCTG AGGAGATATA 1980 

AAACGTAATA ATCCATGATT GTTGCCATGT GAGAGTTTTA AAGGTTAATC AAAATTTCTC 2 04 0 

TTCTTCAGGG CAAACTTGAA GATAAATCTT TTGACTCCAG CTCTTTAGAG GATCTAAAGT 210 0 

GACCTTGATG GACAGTGGAA GAAATCACAA CATGGAATTC CTCGAATAAC AATTTATTGA 2160 

CTTTAAATAA TTTTGTCTAA TGCTACATAT ACACAATTAA AAAACCTTTA CACTATTTCT 2 22 0 

AGAAAGTCAG CATGTATTTT TGGCTCGAAG TTTCTCTAGT GTTTTCTGTG GAAGGAATAA 2 280 
AAATTTGAGT TTCAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAA 



SEP ID NO: 28 Ocq ID NO ; — Protein sequence: 
Protein Accession #: BAB15628.1 

1 11 21 31 41 51 

I I I I I I 

MDKSGIDSLD HVTSDAVELA NRSDNSSDSS LFKTQCIPYS PKGEKRNPIR KFVRTPESVH 60 
ASDSSSDSSF EPIPLTIKAI FERFKNRKKR YKKKKKRRYQ PTGRPRGRPE GRRNPIYSLI 120 
DKKKQFRSRG SGFPFLESEN EKNAPWRKIL TFEQAVARGF FNYIEKLKYE HHLKESLKQM 180 
NVGEDLENED FDSRRYKFLD DDGSISPIEE STAEDEDATH LEDNECDIKL AGDSFIVSSE 24 0 
FPVRLSVYLE EEDITEEAAL SKKRATKAKN TGQRGLKM 



SEP ID NO: 29 Ccq ID NO; 2D DNA sequence 
Nucleic Acid Accession #: NM_004289.3 
Coding sequence: 493-1695 

1 11 21 31 41 51 

I I I I I I 

GCCGCCGCCT CGTCCACCGG AGGAGCCGGC GCCAGCGTGG ACGGCGGCAG CCAGGCTGTG 60 
CAGGGGGGCG GCGGGGACCC CCGAGCGGCT CGGAGTGGCC CCTTGGACGC CGGGGAAGAG 120 
GAGAAGGCAC CCGCGGAACC GACGGCTCAG GTGCCGGACG CTGGCGGATG TGCGAGCGAG 180 
GAGAATGGGG TACTAAGAGA AAAGCACGAA GCTGTGGATC ATAGTTCCCA GCATGAGGAA 24 0 



AATGAAGAAA GGGTGTCAGC CCAGAAGGAG AACTCACTTC AGCAGAATGA TGATGATGAA 3 00 

AACAAAATAG CAGAGAAACC TGACTGGGAG GCAGAAAAGA CCACTGAATC TAGAAATGAG 360 

AGACATCTGA ATGGGACAGA TACTTCTTTC TCTCTGGAAG ACTTATTCCA GTTGCTTTCA 420 

TCACAGCCTG AAAATTCACT GGAGGGCATC TCATTGGGAG ATATTCCTCT TCCAGGCAGT 4 80 

ATCAGTGATG GCATGAATTC TTCAGCACAT TATCATGTAA ACTTCAGCCA GGCTATAAGT 54 0 

CAGGATGTGA ATCTTCATGA GGCCATCTTG CTTTGTCCCA ACAATACATT TAGAAGAGAT 600 

CCAACAGCAA GGACTTCACA GTCACAAGAA CCATTTCTGC AGTTAAATTC TCATACCACC 660 

AATCCTGAGC AAACCCTTCC TGGAACTAAT TTGACAGGAT TTCTTTCACC GGTTGACAAT 720 

CATATGAGGA ATCTAACAAG CCAAGACCTA CTGTATGACC TTGACATAAA TATATTTGAT 7 80 

GAGATAAACT TAATGTCATT GGCCACAGAA GACAACTTTG ATCCAATCGA TGTTTCTCAG 84 0 

CTTTTTGATG AACCAGATTC TGATTCTGGC CTTTCTTTAG ATTCAAGTCA CAATAATACC 900 

TCTGTCATCA AGTCTAATTC CTCTCACTCT GTGTGTGATG AAGGTGCTAT AGGTTATTGC 960 

ACTGACCATG AATCTAGTTC CCATCATGAC TTAGAAGGTG CTGTAGGTGG CTACTACCCA 102 0 

GAACCCAGTA AGCTTTGTCA CTTGGATCAA AGTGATTCTG ATTTCCATGG AGATCTTACA 1080 

TTTCAACACG TATTTCATAA CCACACTTAC CACTTACAGC CAACTGCACC AGAATCTACT 114 0 

TCTGAACCTT TTCCGTGGCC TGGGAAGTCA CAGAAGATAA GGAGTAGATA CCTTGAAGAC 12 00 

ACAGATAGAA ACTTGAGCCG TGATGAACAG CGTGCTAAAG CTTTGCATAT CCCTTTTTCT 12 60 

GTAGATGAAA TTGTCGGCAT GCCTGTTGAT TCTTTCAATA GCATGTTAAG TAGATATTAT 13 20 

CTGACAGACC TACAAGTCTC ACTTATCCGT GACATCAGAC GAAGAGGGAA AAATAAAGTT 13 80 

GCTGCGCAGA ACTGTCGTAA ACGCAAATTG GACATAATTT TGAATTTAGA AGATGATGTA 144 0 

TGTAACTTGC AAGCAAAGAA GGAAACTCTT AAGAGAGAGC AAGCACAATG TAACAAAGCT 1500 

ATTAACATAA TGAAACAGAA ACTGCATGAC CTTTATCATG ATATTTTTAG TAGATTAAGA 1560 

GATGACCAAG GTAGGCCAGT CAATCCCAAC CACTATGCTC TCCAGTGTAC CCATGATGGA 162 0 

AGTATCTTGA TAGTACCCAA AGAACTGGTG GCCTCAGGCC ACAAAAAGGA AACCCAAAAG 1680 

GGAAAGAGAA AGTGAGAAGA AACTGAAGAT GGACTCTATT ATGTGAAGTA GTAATGTTCA 1740 

GAAACTGATT ATTTGGATCA GAAACCATTG AAACTGCTTC AAGAATTGTA TCTTTAAGTA 1800 

CTGCTACTTG AATAACTCAG TTAACGCTGT TTTGAAGCTT ACATGGACAA ATGTTTAGGA 1860 

CTTCAAGATC ACACTTGTGG GCAATCTGGG GGAGCCACAA CTTTTCATGA AGTGCATTGT 192 0 

ATACAAAATT CATAGTTATG TCCAAAGAAT AGGTTAACAT GAAAACCCAG TAAGACTTTC 1980 

CATCTTGGCA GCCATCCTTT TTAAGAGTAA GTTGGTTACT TCAAAAAGAG CAAACACTGG 2 040 

GGATCAAATT ATTTTAAGAG GTATTTCAGT TTTAAATGCA AAATAGCCTT ATTTTCATTT 2100 

AGTTTGTTAG CACTATAGTG AGCTTTTCAA ACACTATTTT AATCTTTATA TTTAACTTAT 2160 
AAATTTTGCT TTCT 



SEP ID NO: 30 Ccq ID NO ; — 3-9- Protein sequence: 
Protein Accession #: NP_004280 

1 11 21 31 41 51 

I I I I I I 

MNSSAHYHVN FSQAISQDVN LHEAILLCPN NTFRRDPTAR TSQSQEPFLQ LNSHTTNPEQ 60 

TLPGTNLTGF LSPVDNHMRN LTSQDLLYDL DINIFDEINL MSLATEDNFD PIDVSQLFDE 120 

PDSDSGLSLD SSHNNTSVIK SNSSHSVCDE GAIGYCTDHE SSSHHDLEGA VGGYYPEPSK 180 

LCHLDQSDSD FHGDLTFQHV FHNHTYHLQP TAPESTSEPF PWPGKSQKIR SRYLEDTDRN 240 

LSRDEQRAKA LHIPFSVDEI VGMPVDSFNS MLSRYYLTDL QVSLIRDIRR RGKNKVAAQN 3 00 

CRKRKLDIIL NLEDDVCNLQ AKKETLKREQ AQCNKAINIM KQKLHDLYHD IFSRLRDDQG 3 60 

RPVNPNHYAL QCTHDGSILI VPKELVASGH KKETQKGKRK 



SEP ID NO: 31 Coq ID NO ; 31 DNA sequence 
Nucleic Acid Accession #: NM_033260.1 
Coding sequence: 1-1208 

1 11 21 31 41 51 

I I i I I I 

ATGAAGTTGG AGGTGTTCGT CCCTCGCGCG GCCCACGGGG ACAAGCAGGG CAGTGACCTG 60 
GAGGGCGCGG GCGGCAGCGA CGCGCCGTCC CCGCTGTCGG CGGCGGGAGA CGACTCCCTG 12 0 
GGCTCAGATG GGGACTGCGC GGCCAAGCCG TCCGCGGGCG GCGGCGCCAG AGATACGCAG 180 
GGCGACGGCG AACAGAGTGC GGGAGGCGGG CCGGGCGCGG AGGAGGCGAT CCCGGCAGCA 240 
GCTGCTGCAG CGGTGGTGGC GGAGGGCGCG GAGGCCGGGG CGGCGGGGCC AGGCGCGGGC 3 00 
GGCGCGGGGA GCGGCGAGGG TGCACGCAGC AAGCCATATA CGCGGCGGCC CAAGCCCCCC 3 60 



TACTCGTACA TCGCGCTCAT CGCCATGGCC ATCCGCGACT CGGCGGGCGG GCGCTTGACG 420 

CTGGCGGAGA TCAACGAGTA CCTCATGGGC AAGTTCCCCT TTTTCCGCGG CAGCTACACG 4 80 

GGCTGGCGCA ACTCCGTGCG CCACAACCTT TCGCTCAACG ACTGCTTCGT CAAGGTGCTG 54 0 

CGCGACCCCT CGCGGCCCTG GGGCAAGGAC AACTACTGGA TGCTCAACCC CAACAGCGAG 6 00 

TACACCTTCG CCGACGGGGT CTTCCGCCGC CGCCGCAAGC GCCTCAGCCA CCGCGCGCCG 660 

GTCCCCGCGC CCGGGCTGCG GCCCGAGGAG GCCCCGGGCC TCCCCGCCGC CCCGCCGCCC 72 0 

GCGCCCGCCG CCCCGGCCTC GCCCCGCATG CGCTCGCCCG CCCGCCAGGA GGAGCGCGCC 780 

AGCCCCGCGG GCAAGTTCTC CAGCTCCTTC GCCATCGACA GCATCCTGCG CAAGCCCTTC 84 0 

CGCAGCCGTC GCCTCAGGGA CACGGCCCCC GGGACGACGC TTCAGTGGGG CGCCGCGCCC 900 

TGCCCGCCGC TGCCCGCGTT CCCCGCGCTC CTCCCCGCGG CGCCCTGCAG GGCCCTGCTG 960 

CCGCTCTGCG CGTACGGCGC GGGCGAGCCG GCGCGGCTGG GCGCGCGCGA GGCCGAGGTG 1020 

CCACCGACCG CGCCGCCCCT CCTGCTTGCA CCTCTCCCGG CGGCGGCCCC CGCCAAGCCA 1080 

CTCCGAGGCC CGGCGGGCGG CGGCGCGCAC CTGTACTGCC CCCTGCGGCT GCCCGCAGCC 114 0 

CTGCAGGCGG CCTTAGTCCG NCGTCCTGGC CCGCACCTGT CGTACCCGGT GGAGACGCTC 12 00 
CTAGCTTGA 



SEP ID NO: 32 Coq ID NO; — 3^ Protein sequence: 
Protein Accession #: NP_150285.1 

1 11 21 31 41 51 

I I I I I I 

MKLEVFVPRA AHGDKQGSDL EGAGGSDAPS PLSAAGDDSL GSDGDCAAKP SAGGGARDTQ 60 
GDGEQSAGGG PGAEEAIPAA AAAAWAEGA EAGAAGPGAG GAGSGEGARS KPYTRRPKPP 12 0 
YSYIALIAMA IRDSAGGRLT LAEINEYLMG KFPFFRGSYT GWRNSVRHNL SLNDCFVKVL 180 
RDPSRPWGKD NYWMLNPNSE YTFADGVFRR RRKRLSHRAP VPAPGLRPEE APGLPAAPPP 24 0 
APAAPASPRM RSPARQEERA SPAGKFSSSF AIDSILRKPF RSRRLRDTAP GTTLQWGAAP 300 
CPPLPAFPAL LPAAPCRALL PLCAYGAGEP ARLGAREAEV PPTAPPLLLA PDPAAAPAKP 3 60 
LRGPAAGGAH LYCPLRLPAA LQAALVRRPG PHLSYPVETL LA 



SEP ID NO: 33 Ccq ID NO ; 33 DNA sequence 
Nucleic Acid Accession #: NM_012128.2 
Coding sequence: 43-27 96 

1 11 21 31 41 51 

I I I I I I 

GAACAAACCA ACATTTGAGC CAGGAATAAC TAGAGAGGAA CAATGGGGTT ATTCAGAGGT 60 

TTTGTTTTCC TCTTAGTTCT GTGCCTGCTG CACCAGTCAA ATACTTCCTT CATTAAGCTG 12 0 

AATAATAATG GCTTTGAAGA TATTGTCATT GTTATAGATC CTAGTGTGCC AGAAGATGAA 180 

AAAATAATTG AACAAATAGA GGATATGGTG ACTACAGCTT CTACGTACCT GTTTGAAGCC 240 

ACAGAAAAAA GATTTTTTTT CAAAAATGTA TCTATATTAA TTCCTGAGAA TTGGAAGGAA 300 

AATCCTCAGT ACAAAAGGCC AAAACATGAA AACCATAAAC ATGCTGATGT TATAGTTGCA 3 60 

CCACCTACAC TCCCAGGTAG AGATGAACCA TACACCAAGC AGTTCACAGA ATGTGGAGAG 42 0 

AAAGGCGAAT ACATTCACTT CACCCCTGAC CTTCTACTTG GAAAAAAACA AAATGAATAT 4 80 

GGACCACCAG GCAAACTGTT TGTCCATGAG TGGGCTCACC TCCGGTGGGG AGTGTTTGAT 540 

GAGTACAATG AAGATCAGCC TTTCTACCGT GCTAAGTCAA AAAAAATCGA AGCAACAAGG 600 

TGTTCCGCAG GTATCTCTGG TAGAAATAGA GTTTATAAGT GTCAAGGAGG CAGCTGTCTT 660 

AGTAGAGCAT GCAGAATTGA TTCTACAACA AAACTGTATG GAAAAGATTG TCAATTCTTT 72 0 

CCTGATAAAG TACAAACAGA AAAAGCATCC ATAATGTTTA TGCAAAGTAT TGATTCTGTT 780 

GTTGAATTTT GTAACGAAAA AACCCATAAT CAAGAAGCTC CAAGCCTACA AAACATAAAG 840 

TGCAATTTTA GAAGTACATG GGAGGTGATT AGCAATTCTG AGGATTTTAA AAACACCATA 900 

CCCATGGTGA CACCACCTCC TCCACCTGTC TTCTCATTGC TGAAGATCCG TCAAAGAATT 960 

GTGTGCTTAG TTCTTGATAA GTCTGGAAGC ATGGGGGGTA AGGACCGCCT AAATCGAATG 102 0 

AATCAAGCAG CAAAACATTT CCTGCTGCAG ACTGTTGAAA ATGGATCCTG GGTGGGGATG 1080 

GTTCACTTTG ATAGTACTGC CACTATTGTA AATAAGCTAA TCCAAATAAA AAGCAGTGAT 1140 

GAAAGAAACA CACTCATGGC AGGATTACCT ACATATCCTC TGGGAGGAAC TTCCATCTGC 12 00 

TCTGGAATTA AATATGCATT TCAGGTGATT GGAGAGCTAC ATTCCCAACT CGATGGATCC 12 60 

GAAGTACTGC TGCTGACTGA TGGGGAGGAT AACACTGCAA GTTCTTGTAT TGATGAAGTG 132 0 

AAACAAAGTG GGGCCATTGT TCATTTTATT GCTTTGGGAA GAGCTGCTGA TGAAGCAGTA 1380 

ATAGAGATGA GCAAGATAAC AGGAGGAAGT CATTTTTATG TTTCAGATGA AGCTCAGAAC 144 0 



AATGGCCTCA TTGATGCTTT TGGGGCTCTT 
TCCCTTCAGC TCGAAAGTAA GGGATTAACA 
GTCATAATTG ATAGTACAGT GGGAAAGGAC 
CCTCCCAGTA TTTCTCTCTG GGATCCCAGT 
GCAACTTCCA AAATGGCCTA TCTCAGTATT 
TACAATCTTC AAGCCAAAGC GAACCCAGAA 
GCAAATTCTT CTGTGCCTCC AATCACAGTG 
TTCCCCAGCC CAATGATTGT TTACGCAGAA 
GCCAATGTGA CTGCTTTCAT TGAATCACAG 
GATAATGGTG CAGGCGCTGA TTCTTTCAAG 
GCATATACAG AAAATGGCAG ATATAGCTTA 
GCCAGGCTAA AATTACGGCC TCCACTGAAT 
AACGGGGAAA TTGAAGCAAA CCCGCCAAGA 
TTGGAGGATT TCAGCCGAAC AGCATCCGGA 
CTTCCCTTGC CTGACCAATA CCCACCAAGT 
GAGGATAAGA TTATTCTTAC ATGGACAGCA 
CAACGTTATA TCATAAGAAT AAGTGCAAGT 
GCTCTTCAAG TAAATACTAC TGATCTGTCA 
GCATTTAAAC CAGAAAATAT CTCAGAAGAA 
AGTATAGATA AAAGCAATTT GACATCAAAA 
ATCCCTCAAG CAAATCCTGA TGACATTGAT 
AAAAGTCATA ATTCTGGAGT TAATATTTCT 
GTAATTGTTA ACTTTATTTT AAGTACCACC 
AAGTAGACCT AGAAGAGAGT TTTAAAAAAC 
TCTTAAAATT CATCCCATGT GTGATCATAA 
AAGGATACTT TGATTAAATA AAAACACTCA 
TTAATAGTTT CATTTATTTG TTATTTTATT 
TTTCATACTG ATACCTGGTT GTATATTATT 
AAATTGCATC AAGAAATTAA AATCATCTAT 
CAAATAAACA ACATTTGGAA AAAAAAAAAA 



ACATCAGGAA ATACTGATCT CTCCCAGAAG 
CTGAATAGTA ATGCCTGGAT GAACGACACT 
ACGTTCTTTC TCATCACATG GAACAGTCTG 
GGAACAATAA TGGAAAATTT CACAGTGGAT 
CCAGGAACTG CAAAGGTGGG CACTTGGGCA 
ACATTAACTA TTACAGTAAC TTCTCGAGCA 
AATGCTAAAA TGAATAAGGA CGTAAACAGT 
ATTCTACAAG GATATGTACC TGTTCTTGGA 
AATGGACATA CAGAAGTTTT GGAACTTTTG 
AATGATGGAG TCTACTCCAG GTATTTTACA 
AAAGTTCGGG CTCATGGAGG AGCAAACACT 
AGAGCCGCGT ACATACCAGG CTGGGTAGTG 
CCTGAAATTG ATGAGGATAC TCAGACCACC 
GGTGCATTTG TGGTATCACA AGTCCCAAGC 
CAAATCACAG ACCTTGATGC CACAGTTCAT 
CCAGGAGATA ATTTTGATGT TGGAAAAGTT 
ATTCTTGATC TAAGAGACAG TTTTGATGAT 
CCAAAGGAGG CCAACTCCAA GGAAAGCTTT 
AATGCAACCC ACATATTTAT TGCCATTAAA 
GTATCCAACA TTGCACAAGT AACTTTGTTT 
CCTACTCCTA CTCCTACTCC TACTCCTGAT 
ACGCTGGTAT TGTCTGTGAT TGGGTCTGTT 
ATTTGAACCT TAACGAAGAA AAAAATCTTC 
AAAACAATGT AAGTAAAGGA TATTTCTGAA 
ACTCATAAAA AT/^TTTTAA GATGTCGGAA 
TGGATATGTA AAAACTGTCA AGATTAAAAT 
TGTAAGAAAT AGTGATGAAC AAAGATCCTT 
TGATGCAACA GTTTTCTGAA ATGATATTTC 
CTGAGTAGTC AAAATACAAG TAAAGGAGAG 
AAAAAAAA 



SEP ID NO: 34 Ccq ID NO ! 31 Protein sequence: 
Protein Accession #: NP__036260,1 

1 11 21 31 41 51 

I I I I I I 

MGLFRGFVFL LVLCLLHQSN TSFIKLNNNG FEDIVIVIDP SVPEDEKIIE QIEDMVTTAS 

TYLFEATEKR FFFKNVSILI PENWKENPQY KRPKHENHKH ADVIVAPPTL PGRDEPYTKQ 

FTECGEKGEY IHFTPDLLLG KKQNEYGPPG KLFVHEWAHL RWGVFDEYNE DQPFYRAKSK 

KIEATRCSAG ISGRNRVYKC QGGSCLSRAC RIDSTTKLYG KDCQFFPDKV QTEKASIMFM 

QSIDSWEFC NEKTHNQEAP SLQNIKCNFR STWEVISNSE DFKNTIPMVT PPPPPVFSLL 

KIRQRIVCLV LDKSGSMGGK DRLNRMNQAA KHFLLQTVEN GSWVGMVHFD STATIVNKLI 

QIKSSDERNT LMAGLPTYPL GGTSICSGIK YAFQVIGELH SQLDGSEVLL LTDGEDNTAS 

SCIDEVKQSG AIVHFIALGR AADEAVIEMS KITGGSHFYV SDEAQNNGLI DAFGALTSGN 

TDLSQKSLQL ESKGLTLNSN AWMNDTVIID STVGKDTFFL ITWNSLPPSI SLWDPSGTIM 

ENFTVDATSK MAYLSIPGTA KVGTWAYNLQ AKANPETLTI TVTSRAANSS VPPITVNAKM 

NKDVNSFPSP MIVYAEILQG YVPVLGANVT AFIESQNGHT EVLELLDNGA GADSFKNDGV 

YSRYFTAYTE NGRYSLKVRA HGGANTARLK LRPPLNRAAY IPGWWNGEI EANPPRPEID 

EDTQTTLEDF SRTASGGAFV VSQVPSLPLP DQYPPSQITD LDATVHEDKI ILTWTAPGDN 

FDVGKVQRYI IRISASILDL RDSFDDALQV NTTDLSPKEA NSKESFAFKP ENISEENATH 

IFIAIKSIDK SNLTSKVSNI AQVTLFIPQA NPDDIDPTPT PTPTPDKSHN SGVNISTLVL 

SVIGSWIVN FILSTTI 



1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 



SEP ID NO: 35 Ccq ID NO: 3 5 DNA sequence 
Nucleic Acid Accession #: NM_000901.1 
Coding sequence: 217-3171 

1 11 21 31 41 51 

I I I i I I 

CGCGGGAGCC AACTTCAGGC TGCTCAGAGG AAGCCCGTGC AGTCAGTCAC CTGGGTGCAA 60 

GAGCGTTGCT GCCTCGGGCT CTCCCGCTGC AGGGAGAGCG GCACTCGCTG GCCTGGATGT 120 

GGTTGGATTT AGGGGGGCTC CGCAGCAGGG GTTTCGTGGC GGTGGCAAGC GCTGCAACAG 180 

GTAGACGGCG AGAGACGGAC CCCGGCCGAG GCAGGGATGG AGACCAAAGG CTACCACAGT 24 0 

CTCCCTGAAG GTCTAGATAT GGAAAGACGG TGGGGTCAAG TTTCTCAGGC TGTGGAGCGT 300 

TCTTCCCTGG GACCTACAGA GAGGACCGAT GAGAATAACT ACATGGAGAT TGTCAACGTA 360 

AGCTGTGTTT CCGGTGCTAT TCCAAACAAC AGTACTCAAG GAAGCAGCAA AGAAAAACAA 4 20 

GAACTACTCC CTTGCCTTCA GCAAGACAAT AATCGGCCTG GGATTTTAAC ATCTGATATT 4 80 

AAAACTGAGC TGGAATCTAA GGAACTTTCA GCAACTGTAG CTGAGTCCAT GGGTTTATAT 54 0 

ATGGATTCTG TAAGAGATGC TGACTATTCC TATGAGCAGC AGAACCAACA AGGAAGCATG 600 

AGTCCAGCTA AGATTTATCA GAATGTTGAA CAGCTGGTGA AATTTTACAA AGGAAATGGC 660 

CATCGTCCTT CCACTCTAAG TTGTGTGAAC ACGCCCTTGA GATCATTTAT GTCTGACTCT 720 

GGGAGCTCCG TGAATGGTGG CGTCATGCGC GCCATTGTTA AAAGCCCTAT CATGTGTCAT 7 80 

GAGAAAAGCC CGTCTGTTTG CAGCCCTCTG AACATGACAT CTTCGGTTTG CAGCCCTGCT 840 

GGAATCAACT CTGTGTCCTC CACCACAGCC AGCTTTGGCA GTTTTCCAGT GCACAGCCCA 900 

ATCACCCAGG GAACTCCTCT GACATGCTCC CCTAATGCTG AAAATCGAGG CTCCAGGTCG 960 

CACAGCCCTG CACATGCTAG CAATGTGGGC TCTCCTCTCT CAAGTCCGTT AAGTAGCATG 1020 

AAATCCTCAA TTTCCAGCCC TCCAAGTCAC TGCAGTGTAA AATCTCCAGT CTCCAGTCCC 10 80 

AATAATGTCA CTCTGAGATC CTCTGTGTCT AGCCCTGCAA ATATTAACAA CTCAAGGTGC 1140 

TCTGTTTCCA GCCCTTCGAA CACTAATAAC AGATCCACGC TTTCCAGTCC GGCAGCCAGT 1200 

ACTGTGGGAT CTATCTGTAG CCCTGTAAAC AATGCCTTCA GCTACACTGC TTCTGGCACC 1260 

TCTGCTGGAT CCAGTACATT GCGGGATGTG GTTCCCAGTC CAGACACGCA GGAGAAAGGT 132 0 

GCTCAAGAGG TCCCTTTTCC TAAGACTGAG GAAGTAGAGA GTGCCATCTC AAATGGTGTG 13 80 

ACTGGCCAGC TTAATATTGT CCAGTACATA AAACCAGAAC CAGATGGAGC TTTTAGCAGC 144 0 

TCATGTCTAG GAGGAAATAG CAAAATAAAT TCGGATTCTT CATTCTCAGT ACCAATAAAG 1500 

CAAGAATCAA CCAAGCATTC ATGTTCAGGC ACCTCTTTTA AAGGGAATCC AACAGTAAAC 1560 

CCGTTTCCAT TTATGGATGG CTCGTATTTT TCCTTTATGG ATGATAAAGA CTATTATTCC 162 0 

CTATCAGGAA TTTTAGGACC ACCTGTGCCC GGCTTTGATG GTAACTGTGA AGGCAGCGGA 16 80 

TTCCCAGTGG GTATTAAACA AGAACCAGAT GACGGGAGCT ATTACCCAGA GGCCAGCATC 1740 

CCTTCCTCTG CTATTGTTGG GGTGAATTCA GGTGGACAGT CCTTCCACTA CAGGATTGGT 1800 

GCTCAAGGTA CAATATCTTT ATCACGATCG GCTAGAGACC AATCTTTCCA ACACCTGAGT 1860 

TCCTTTCCTC CTGTCAATAC TTTAGTGGAG TCATGGAAAT CACACGGCGA CCTGTCGTCT 192 0 

AGAAGAAGTG ATGGGTATCC GGTCTTAGAA TACATTCCAG AAAATGTATC AAGCTCTACT 1980 

TTACGAAGTG TTTCTACTGG ATCTTCAAGA CCTTCAAAAA TATGTTTGGT GTGTGGGGAT 2 040 

GAGGCTTCAG GATGCCATTA TGGGGTAGTC ACCTGTGGCA GCTGCAAAGT TTTCTTCAAA 2100 

AGAGCAGTGG AAGGGCAACA CAACTATTTA TGTGCTGGAA GAAATGATTG CATCATTGAT 2160 

AAGATTCGAC GAAAGAATTG TCCTGCTTGC AGACTTCAGA AATGTCTTCA AGCTGGAATG 2 22 0 

AATTTAGGAG CACGAAAGTC AAAGAAGTTG GGAAAGTTAA AAGGGATTCA CGAGGAGCAG 2280 

CCACAGCAGC AGCAGCCCCC ACCCCCACCC CCACCCCCGC AAAGCCCAGA GGAAGGGACA 2340 

ACGTACATCG CTCCTGCAAA AGAACCCTCG GTCAACACAG CACTGGTTCC TCAGCTCTCC 2400 

ACAATCTCAC GAGCGCTCAC ACCTTCCCCC GTTATGGTCC TTGAAAACAT TGAACCTGAA 2460 

ATTGTATATG CAGGCTATGA CAGCTCAAAA CCAGATACAG CCGAAAATCT GCTCTCCACG 2 52 0 

CTCAACCGCT TAGCAGGCAA ACAGATGATC CAAGTCGTGA AGTGGGCAAA GGTACTTCCA 2 580 

GGATTTAAAA ACTTGCCTCT TGAGGACCAA ATTACCCTAA TCCAGTATTC TTGGATGTGT 2 640 

CTATCATCAT TTGCCTTGAG CTGGAGATCG TACAAACATA CGAACAGCCA ATTTCTCTAT 2700 

TTTGCACCAG ACCTAGTCTT TAATGAAGAG AAGATGCATC AGTCTGCCAT GTATGAACTA 2 760 

TGCCAGGGGA TGCACCAAAT GAGGCTTCAG TTCGTTCGAC TGCAGCTCAC CTTTGAAGAA 2 82 0 

TACACCATCA TGAAAGTTTT GCTGCTACTA AGCACAATTC CAAAGGATGG CCTCAAAAGC 2 88 0 

CAGGCTGCAT TTGAAGAAAT GAGGACAAAT TACATCAAAG AACTGAGGAA GATGGTAACT 2 94 0 

AAGTGTCCCA ACAATTCTGG GCAGAGCTGG CAGAGGTTCT ACCAACTGAC CAAGCTGCTG 3000 

GACTCCATGC ATGACCTGGT GAGCGACCTG CTGGAATTCT GCTTCTACAC CTTCCGAGAG 3 060 

TCCCATGCGC TGAAGGTAGA GTTCCCCGCA ATGCTGGTGG AGATCATCAG CGACCAGCTG 312 0 

CCCAAGGTGG AGTCGGGGAA CGCCAAGCCG CTCTACTTCC ACCGGAAGTG ACTGCCCGCT 318 0 

GCCCAGAAGA ACTTTGCCTT AAGTTTCCCT GTGTTGTTCC ACACCCAGAA GGACCCAAGA 3240 

AAACCTGTTT TTAACATGTG ATGGTTGATT CACACTTGTT CAACAGTTTC TCAAGTTTAA 3300 



AGTCATGTCA GAGGTTTGGA GCCGGGAAAG CTGTTTTTCC GTGGATTTGG CGAGACCAGA 33 60 

GCAGTCTGAA GGATTCCCCA CCTCCAATCC CCCAGCGCTT AGAAACATGT TCCTGTTCCT 3420 

CGGGATGAAA AGCCATATCT AGTCAATAAC TCTGATTTTG ATATTTTCAC AGATGGAAGA 34 80 

AGTTTTAACT ATGCCGTGTA GTTTCTGGTA TCGTTCGCTT GTTTTAAAAG GGTTCAAGGA 3 54 0 

CTAACGAACG TTTTAAAGCT TACCCTTGGT TTGCACATAA AACGTATAGT CAATATGGGG 360 0 

CATTAATATT CTTTTGTTAT TAAAAAAACA CAAAAAAATA ATAAAAAAAT ATATACAGAT 3660 

TCCTGTTGTG TAATAACAGA ACTCGTGGCG TGGGGCAGCA GCTGCCTCTG AGCCCTCGCT 372 0 

CGTCCACGGT CTTCTGCATC ACTGGTATAC ACACTCGTTA GCGTCCATTT CTTATTTAAT 3 780 

TAGAATGGAT AAGATGATGT TAAATGCCTT GGTTTGATTT CTAGTATCTA TTGTGTTGGC 3 84 0 

TTTACAAATA ATTTTTTGCA GTCTTTTGCT GTGCTGTACA TTACTGTATG TATAAATTAT 3 900 

GAAGGACCTG AAATAAGGTA TAAGGATCTT TTGTAAATGA GACACATACA AAAAAAATCT 3 960 

TTAATGGTTA ATAGGATGAA TGGGAAAGTA TTTTTGAAAG AATTCTATTT TGCTGGAGAC 4 02 0 

TATTTAAGTA CTATCTTTGT CTAAACAAGG TAATTTTTTT TTGTAAAGTG CAATGTCCTG 4 0 80 

CATGCATAAT GAACCGTTTA CAGTGTATTT AAGAAAGGGA AAGCTGTGCC TTTTTTAGCT 414 0 

TCATATCTAA TTTACCATTA TTTTACAGTC TCTGTTGTAA ATAACCACAC TGAAACCTCT 42 00 

TCGGTTGTCT TGAAACCTTT CTACTTTTTC TGTACTTTTT GTTTTGTTCT TGGTCTCCCG 4260 

CTTGGGGCAT TTGTGGGACT CCAGCACGTT TTCTGGCTTC TGCTTCATCC TGCTCCATCG 4320 

GGGAATGACA CACTGCGGTG TCTGCAGCTC CTGGAAGGTG TCATTTGACA ACACATGTGG 43 80 

GAGAGGAGGT CCTTGGAGTG CTGCAGCTTT GGGAAAGCCT GCCTCGTTTC CCTTTTCCTC 4440 

TAGAAGCAGA ACCAGCTCTA CGAGAGTGAG ACTGGGAACT TGATGGCTCA GAGAGCATCT 450 0 

TTTCCTCCCA TTTTAGAAAA TCAGATTTTC TCCTGTGGGA AAAAAAAATT CCATGCACTC 4560 

TCTCTCTGTT AAAGATCAGC TATTCCCTTC TGATCTTGGA AAGAGGTTCT GCACTCCTGG 462 0 

AACCGGTCAC AGGAACGCAC AGATCATGGC AGGATGCGCT GGGACGGCCC ATCTTGGCAA 468 0 

GGTTCAGTCT GAATGGCATG GAGACCGGGA GATAGAGGGG TTTTAGATTT TTAAAAGGTA 474 0 

GGTTTTAAAA ATAAGTTTTA TACATAAACA GTTTTGGAGA AAAATTACAG ATCATATAAG 4800 

CAAGACAGTG GCACTAAAAT GTTTAATTCA TTAATCTGTT TGTTTGGCAC TGATGCAATG 4860 

TATGGCTTTT CTCTTGCCCC AAATCACAAA CATATGTATC TTTGGGGAAA CTAACAATAT 4 92 0 

GATTGCACTA AATAAACTAC TTTGAATAGA GGCCAAATTA ATCTTTTAAA AATGATGATA 498 0 

ATCATCAGGT TTACTCAGTG AAATCATATT AATTATTTTC CAAAATCTAA AAGCTGTAGC 504 0 

TGGAGAAGCC CATGGCCACG AGGAAGCAGC AATTAATTAG ATCAACACTT TTCTCCAGGG 5100 

TTCACCATGC AGGCAACATT ACCTTGTCTT TCAAAAGACA CCTGCCTTAG TGCAAGGGGA 5160 

T^CCTGTGAA AGCTGCACTC AGAGGGAGGA GTCTTTCTTA CATAATTTGC AATTTCAGGA 522 0 

ATTTAATTTA TAGGCAGATC TTTAAATACA GTCAACTTAC GGTGCACAGT AATATGAAAG 52 8 0 

CCACACTTTG AAGGTAATAA ATACACAGCA TGCAGACTGG GAGTTGCTAG CAAACAAATG 534 0 

GCTTACTTAC AAAAGCAGCT TTTAGTTCAG ACTTAGTTTT TATAAAATGA GAATTCTGAC 5400 

TTACTTAACC AGGTTTGGGA TGGAGATGGT CTGCATCAGC TTTTTGTATT AACAAAGTTA 5460 

CTGGCTCTTT GTGTGTCTCC AGGTAACTTT GCTTGATTAA ACAGCAAAGC CATATTCTAA 552 0 

ATTCACTGTT GAATGCCTGT CCCAGTCCAA ATTGTCTGTC TGCTCTTATT TTTGTACCAT 5580 

ATTGCTCTTA AAAATCTTGG TTTGGTACAG TTCATAATTC ACCAAAAAGT TCATATAATT 564 0 

TAAAGAAACA CTAAATTAGT TTAAAATGAA GCAATTTATA TCTTTATGCA AAAACATATG 5700 
TCTGTCTTTG CAAAGGACTG TAAGCAGATT ACAATAAATC CTTTACTTT 



SEP ID NO: 36 Ccq ID NO: 3 6 Protein sequence : 
Protein Accession #: NP_000892,1 

1 11 21 31 41 51 

I I I I i I 

METKGYHSLP EGLDMERRWG QVSQAVERSS LGPTERTDEN NYMEIVNVSC VSGAIPNNST 60 

QGSSKEKQEL LPCLQQDNNR PGILTSDIKT ELESKELSAT VAESMGLYMD SVRDADYSYE 120 

QQNQQGSMSP AKIYQNVEQL VKFYKGNGHR PSTLSCVNTP LRSFMSDSGS SVNGGVMRAI 180 

VKSPIMCHEK SPSVCSPLNM TSSVCSPAGI NSVSSTTASF GSFPVHSPIT QGTPLTCSPN 24 0 

AENRGSRSHS PAHASNVGSP LSSPLSSMKS SISSPPSHCS VKSPVSSPNN VTLRSSVSSP 300 

ANINNSRCSV SSPSNTNNRS TLSSPAASTV GSICSPVNNA FSYTASGTSA GSSTLRDWP 360 

SPDTQEKGAQ EVPFPKTEEV ESAISNGVTG QLNIVQYIKP EPDGAFSSSC LGGNSKINSD 420 

SSFSVPIKQE STKHSCSGTS FKGNPTVNPF PFMDGSYFSF MDDKDYYSLS GILGPPVPGF 480 

DGNCEGSGFP VGIKQEPDDG SYYPEASIPS SAIVGVNSGG QSFHYRIGAQ GTISLSRSAR 540 

DQSFQHLSSF PPVNTLVESW KSHGDLSSRR SDGYPVLEYI PENVSSSTLR SVSTGSSRPS 600 

KICLVCGDEA SGCHYGWTC GSCKVFFKRA VEGQHNYLCA GRNDCIIDKI RRKNCPACRL 660 

QKCLQAGMNL GARKSKKLGK LKGIHEEQPQ QQQPPPPPPP PQSPEEGTTY lAPAKEPSVN 720 

TALVPQLSTI SRALTPSPVM VLENIEPEIV YAGYDSSKPD TAENLLSTLN RLAGKQMIQV 7 80 



VKWAKVLPGF KNLPLEDQIT LIQYSWMCLS SFALSWRSYK HTNSQFLYFA PDLVFNEEKM 84 0 

HQSAMYELCQ GMHQISLQFV RLQLTFEEYT IMKVLLLLST IPKDGLKSQA AFEEMRTNYI 900 

KELRKMVTKC PKFNSGQSWQR FYQLTKLLDS MHDLVSDLLE FCFYTFRESH ALKVEFPAML 960 
VEIISDQLPK VESGNAKPLY FHRK 

SEP ID NO: 37 Ccq ID NO i 3 7 DNA sequence 

Nucleic Acid Accession #: see Table 25 & 25A for complete list 

1 11 21 31 41 51 

I I I I I I 

CCTACCAGGT TCAAGCAACT CTGCTGCCTC AGCTCCCAAG TAGCTGGGAT TACAGGTGCA 60 

TGCCACTACA CCTGGCTTTT TGTATTTTTA GTAGAGATGG TTTTCACTAT GTTGGCCAGG 12 0 

CTGATCTTGA ATTCCTGGCC TGAAGTAATC TGCCTGCCTC AGCCTCCCAA AGTGCTGGGA 180 

TTATAGGAGC CACCACACCT GGCATAACTG GTATTTTTTA TATGCTTCCT GGGCAACTTA 240 

AAAAATTGAT TACTCTGTTG TTTCTTCCTT tTTTTTTTTT TTTTGGCTTT GACCAATTTG 300 

TGAGACCCAA GTATCTCCTA CCTAGAAAAA AAACACACTA AACAGTAAAT GATTACCAAC 360 

CTATTTGGAA CAAATCTCAA TTAATTAACA TATACTTCAA GGAGAAGACT TAACAAAATC 42 0 

TTACTTTTCA TTCTTAATAG CTCTTTCCAT AAAAATGTTC CACAAGTGTA TCAAATTAGT 4 80 

CCTAACAACT ACTGTTAAGT GATTAATGAA ACAGGAGTGA CAGGAGTGAA TTTAATAATA 540 

GCAATAAATA CAGATGGGAC TACATAAATT GTGGAGGTCC TGATGCAAAA CTCTCTCTGT 600 

ATTCGATGGC ATCTCAGCTT TCTCATAGAG CTGTTTCACT GTGAGGGTCT TTATCCTTCA 660 

TGCAGAGCTT CATTATTTTC TTTCTTCTAG CAATCAGTCC AAAGCACAAT GTCAGAAAGA 72 0 

TCACAACACA TGCAGCAATA ATGGGCTCTA TTGGTACACC CACAGTTTTA TCTTTAACAA 7 80 
TC 

SEP ID NO: 38 Ccq ID NO; 3 8 DNA sequence 
Nucleic Acid Accession #: NM_001192.1 
Coding sequence: 219-773 

1 11 21 31 41 51 

I I I I I ' 

AAGACTCAAA CTTAGAAACT TGAATTAGAT GTGGTATTCA AATCCTTACG TGCCGCGAAG 60 

ACACAGACAG CCCCCGTAAG AACCCACGAA GCAGGCGAAG TTCATTGTTC TCAACATTCT 120 

AGCTGCTCTT GCTGCATTTG CTCTGGAATT CTTGTAGAGA TATTACTTGT CCTTCCAGGC 180 

TGTTCTTTCT GTAGCTCCCT TGTTTTCTTT TTGTGATCAT GTTGCAGATG GCTGGGCAGT 240 

GCTCCCAAAA TGAATATTTT GACAGTTTGT TGCATGCTTG CATACCTTGT CAACTTCGAT 3 00 

GTTCTTCTAA TACTCCTCCT CTAACATGTC AGCGTTATTG TAATGCAAGT GTGACCAATT 3 60 

CAGTGAAAGG AACGAATGCG ATTCTCTGGA CCTGTTTGGG ACTGAGCTTA ATAATTTCTT 420 

TGGCAGTTTT CGTGCTAATG TTTTTGCTAA GGAAGATAAG CTCTGAACCA TTAAAGGACG 4 80 

AGTTTAAAAA CACAGGATCA GGTCTCCTGG GCATGGCTAA CATTGACCTG GAAAAGAGCA 540 

GGACTGGTGA TGAAATTATT CTTCCGAGAG GCCTCGAGTA CACGGTGGAA GAATGCACCT 600 

GTGAAGACTG CATCAAGAGC AAACCGAAGG TCGACTCTGA CCATTGCTTT CCACTCCCAG 660 

CTATGGAGGA AGGCGCAACC ATTCTTGTCA CCACGAAAAC GAATGACTAT TGCAAGAGCC 720 

TGCCAGCTGC TTTGAGTGCT ACGGAGATAG AGAAATCAAT TTCTGCTAGG TAATTAACCA 780 

TTTCGACTCG AGCAGTGCCA CTTTAAAAAT CTTTTGTCAG AATAGATGAT GTGTCAGATC 840 

TCTTTAGGAT GACTGTATTT TTCAGTTGCC GATACAGCTT TTTGTCCTCT AACTGTGGAA 900 

ACTCTTTATG TTAGATATAT TTCTCTAGGT TACTGTTGGG AGCTTAATGG TAGAAACTTC 960 
CTTGGTTTCA TGATTAAAGT CTTTTTTTTT CCTGA 

SEP ID NO : 3 9 Ccq ID NO ; 39 Protein sequence: 
Protein Accession #: NP_001183.1 

1 11 21 31 41 51 

I I I I I I 

MLQMAGQCSQ NEYFDSLLHA CIPCQLRCSS NTPPLTCQRY CNASVTNSVK GTNAILWTCL 60 

GLSLIISLAV FVLMFLLRKI SSEPLKDEFK NTGSGLLGMA NIDLEKSRTG DEIILPRGLE 120 

YTVEECTCED CIKSKPKVDS DHCFPLPAME EGATILVTTK TNDYCKSLPA TUiSATEIEKS 180 
ISAR 



SEP ID NO: 40 Dcq ID NO ! 10 DNA sequence 
Nucleic Acid Accession #: NM_025087.1 
Coding sequence: 183-2282 

1 11 21 31 41 51 

I I I I I I 

ACACTGCCTC GGTTCGGCAA GTGGGTCAGT TGGCTGGGGC TCACTTGGCA ACGGGACGCG 60 

GGAACGAGGG GCGCGGACGC AGGCCCGGGA GGACGCGGCG GCGGGAACCT GGGGGCGCAG 120 

GGCTAGGGCA GCGGGCCCGA CCCGCACGGC TTTCCTGGAA AGCGCTGCCC CTCGCCGCGG 180 

CGATGACCTC GCTGTGGAGA GAAATCCTCT TGGAGTCGCT GCTGGGATGT GTTTCTTGGT 240 

CTCTCTACCA TGACCTGGGA CCGATGATCT ATTACTTTCC TTTGCAAACA CTAGAACTCA 300 

CTGGGCTTGA AGGTTTTAGT ATAGCATTTC TTTCTCCAAT ATTCCTAACA ATTACTCCTT 3 60 

TCTGGAAATT GGTTAACAAG AAGTGGATGC TAACCCTGCT GAGGATAATC ACTATTGGCA 420 

GCATAGCCTC CTTCCAGGCT CCAAATGCCA AACTTCGACT GATGGTTCTT GCGCTTGGGG 4 80 

TGTCTTCCTC ACTGATAGTG CAAGCTGTGA CTTGGTGGTC AGGAAGTCAT TTGCAAAGGT 54 0 

ACCTCAGAAT TTGGGGATTC ATTTTAGGAC AGATTGTTCT TGTTGTTCTA CGCATATGGT 600 

ATACTTCACT AAACCCAATC TGGAGTTATC AGATGTCCAA CAAAGTGATA CTGACATTAA 660 

GTGCCATAGC CACACTTGAT CGTATTGGCA CAGATGGTGA CTGCAGTAAA CCTGAAGAAA 720 

AGAAGACTGG TGAGGTAGCC ACGGGGATGG CCTCTAGACC CAACTGGCTG CTGGCAGGGG 780 

CTGCTTTTGG TAGCCTTGTG TTCCTCACCC ACTGGGTTTT TGGAGAAGTC TCTCTTGTTT 840 

CCAGATGGGC AGTGAGTGGG CATCCACATC CAGGGCCAGA TCCTAACCCA TTTGGAGGTG 900 

CAGTACTGCT GTGCTTGGCA AGTGGATTGA TGCTTCCATC TTGTTTGTGG TTTCGTGGTA 960 

CTGGTTTGAT CTGGTGGGTT ACAGGAACAG CTTCAGCTGC GGGGCTCCTT TACCTGCACA 102 0 

CATGGGCAGC TGCTGTGTCT GGCTGTGTCT TCGCCATCTT TACTGCATCC ATGTGGCCCC 1080 

AAACACTTGG ACACCTTATT AACTCAGGGA CAAACCCTGG GAAAACCATG ACCATTGCCA 1140 

TGATATTTTA TCTTCTAGAA ATATTTTTCT GTGCCTGGTG CACAGCTTTT AAGTTTGTCC 1200 

CAGGAGGTGT CTACGCTAGA GAAAGATCAG ATGTGCTTTT GGGGACAATG ATGTTAATTA 1260 

TCGGGCTGAA TATGCTATTT GGTCCTAAGA AAAACCTTGA TTTGCTTCTT CAAACAAAAA 132 0 

ACAGTTCTAA AGTGCTTTTC AGAAAGAGTG AAAAATACAT GAAACTTTTT CTGTGGCTGC 1380 

TTGTTGGTGT GGGATTGTTG GGATTAGGAC TACGGCATAA AGCCTATGAG AGAAAACTGG 1440 

GCAAAGTGGC ACCAACCAAA GAGGTCTCTG CTGCCATCTG GCCTTTCAGG TTTGGATATG 1500 

ACAATGAAGG GTGGTCTAGT CTAGAAAGAT CAGCTCACCT GCTCAATGAA ACAGGTGCAG 1560 

ATTTCATAAC AATTTTGGAG AGTGATGCTT CTAAGCCCTA TATGGGGAAC AATGACTTAA 162 0 

CCATGTGGCT AGGGGAAAAG TTGGGTTTCT ATACAGACTT TGGTCCAAGC ACAAGGTATC 1680 

ACACTTGGGG GATTATGGCT TTGTCAAGAT ACCCAATTGT GAAATCTGAG CATCACCTTC 1740 

TTCCGTCACC AGAGGGCGAG ATCGCACCAG CCATCACATT GACCGTTAAC ATTTCGGGCA 1800 

AGCTGGTGGA TTTTGTCGTG ACACACTTTG GGAACCACGA AGATGACCTC GACAGGAAAC 1860 

TGCAGGCTAT TGCTGTTTCA AAACTACTGA AAAGTAGCTC TAATCAAGTG ATATTTCTGG 192 0 

GATATATCAC TTCAGCACCT GGCTCCAGAG ATTATCTACA GCTCACTGAA CATGGCAATG 1980 

TGAAGGATAT CGACAGCACT GATCATGACA GATGGTGTGA ATACATTATG TATCGAGGGC 2040 

TGATCAGGTT GGGTTATGCA AGAATCTCCC ATGCTGAACT GAGTGATTCA GAAATTCAGA 2100 

TGGCAAAATT TAGGATCCCT GATGACCCCA CTAATTATAG AGACAACCAG AAAGTGGTCA 2160 

TAGACCACAG AGAAGTTTCT GAGAAAATTC ATTTTAATCC CAGATTTGGA TCCTACAAAG 222 0 

AAGGACACAA TTATGAAAAC AACCATAATT TTCATATGAA TACTCCCAAA TACTTTTTAT 2280 

GAAACATTTA AAACAAGAAG TTATTGGCTG GGAAAATCTA AGAAAAAAAG TATGTAAGAT 2340 

AAAAAGAAGA GATTAATGAA AGTGGGAAAA TACACATGAA GAACCTCAAC TTAAAAAACA 2400 

CATGGTATCT ATGCAGTGGG AAATTACCTC CATTTGTAAA CTATGTTGCT TAATAAAAAC 2460 
ATTTCTCTAA AAAAAAAAAA AAAAAA 



SEP ID N0:41 Ccq ID NO; 11 Protein sequence: 
Protein Accession #: NP_079363.1 

1 11 21 31 41 51 

I I I I I ' 

MTSLWREILL ESLLGCVSWS LYHDLGPMIY YFPLQTLELT GLEGFSIAFL SPIFLTITPF 60 

WKLVNKKWML TLLRIITIGS lASFQAPNAK LRLMVLALGV SSSLIVQAVT WWSGSHLQRY 120 

LRIWGFILGQ IVLWLRIWY TSLNPIWSYQ MSNKVILTLS AIATLDRIGT DGDCSKPEEK 180 

KTGEVATGMA SRPNWLLAGA AFGSLVFLTH WVFGEVSLVS RWAVSGHPHP GPDPNPFGGA 240 

VLLCLASGLM LPSCLWFRGT GLIWWVTGTA SAAGLLYLHT WAAAVSGCVF AIFTASMWPQ 300 

TLGHLINSGT NPGKTMTIAM IFYLLEIFFC AWCTAFKFVP GGVYARERSD VLLGTMMLII 360 



GliNMLFGPKK NLDLLLQTKN SSKVLFRKSE KYMKLFLWLL VGVGLLGLGL RHKAYERKLG 42 0 

KVAPTKEVSA AIWPFRFGYD NEGWSSLERS AHLLNETGAD FITILESDAS KPYMGNNDLT 4 80 

MWLGEKLGFY TDFGPSTRYH TWGIMALSRY PIVKSEHHLL PSPEGEIAPA ITLTVNISGK 54 0 

LVDFWTHFG NHEDDLDRKL QAIAVSKLLK SSSNQVIFLG YITSAPGSRD YLQLTEHGNV 600 

KDIDSTDHDR WCEYIMYRGL IRLGYARISH AELSDSEIQM AKFRIPDDPT NYRDNQKWI 660 
DHREVSEKIH FNPRFGSYKE GHNYENNHNF HMNTPKYFL 



